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i 1 These notes are an elaboration on: (i) a short course that I gave at the IPhT-Saclay in May- 

i-pH June 2012; (ii) a previous letter 

i [|Davll| on reversibility in quantum mechanics. 

They present an introductory but hopefully coherent, view of the main formalizations 
of quantum mechanics, of their interrelations and of their common physical underpinnings: 
causality, reversibility and locality/ separability. The approaches covered are mainly: (ii) the 
canonical formalism; (ii) the algebraic formalism; (iii) the quantum logic formulation. Other 
t-h subjects: quantum information approaches, quantum correlations, contextuality and non-locality 

^ issues, quantum measurements, interpretations and alternate theories, quantum gravity, are 

only very briefly and superficially discussed. 

Most of the material is not new, but is presented in an original, homogeneous and hopefully 
not technical or abstract way. I try to define simply all the mathematical concepts used and to 
justify them physically. These notes should be accessible to young physicists (graduate level) 
with a good knowledge of the standard formalism of quantum mechanics, and some interest 
t— I for theoretical physics (and mathematics). 

These notes do not cover the historical and philosophical aspects of quantum physics. 
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Chapter 1 

Introduction 



1.1 Motivation 

Quantum mechanics in its modern form is now more than 80 years old. It is probably the 
most successful physical theory that was ever proposed. It started as an attempt to understand 
the structure of the atom and the interactions of matter and light at the atomic scale, and it be- 
came quickly the general physical framework, valid from the presently accessible high energy 
scales (10 TeV~ 10~ 19 m) - and possibly from the Planck scale (10~ 35 m) - up to macroscopic 
scales (from I ~ 1 nm up to t ~ 10 5 m depending of physical systems and experiments). Be- 
yond these scales, classical mechanics takes over as an effective theory, valid when quantum 
interferences and non-local correlations effects can be neglected. 

Quantum mechanics has fully revolutionized physics (as a whole, from particle and nu- 
clear physics to atomic an molecular physics, optics, condensed matter physics and material 
science), chemistry (again as a whole), astrophysics, etc. with a big impact on mathematics 
and of course a huge impact on modern technology, the whole communication technology, 
computers, energy, weaponry (unfortunately) etc. In all these domains, and despites the huge 
experimental and technical progresses of the last decades, quantum mechanics has never been 
seriously challenged by experiments, and its mathematical foundations are very solid. 

Quantum information has become a important and very active field (both theoretically and 
experimentally) in the last decades. It has enriched our points of view on the quantum the- 
ory, and on its applications (quantum computing). Quantum information, together with the 
experimental tests of quantum mechanics, the theoretical advances in quantum gravity and 
cosmology, the slow diffusion of the concepts from quantum theory in the general public, etc. 
have led to a revival of the discussions about the principles of quantum mechanics and its 
seemingly paradoxical aspects. 

Thus one sometimes gets the feeling that quantum mechanics is both: (i) the unchallenged 
and dominant paradigm of modern physical sciences and technologies, (ii) still (often pre- 
sented as) mysterious and poorly understood, and waiting for some revolution. 

These lecture notes present a brief and introductory (but hopefully coherent) view of the 
main formalizations of quantum mechanics (and of its version compatible with special relativ- 
ity, quantum field theory), of their interrelations and of their theoretical foundation. 

The "standard" formulation of quantum mechanics (involving the Hilbert space of pure 
states, self-adjoint operators as physical observables, and the probabilistic interpretation given 
by the Born rule), and the path integral and functional integral representations of probabilities 
amplitudes are the standard tools used in most applications of quantum theory in physics and 
chemistry. It is important to be aware that there are other formulations of quantum mechanics, 
i.e. other representations (in the mathematical sense) of quantum mechanics, which allow a 
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better comprehension and justification of the quantum theory. This course will focus on two of 
them, algebraic QM and the so called "quantum logic" approach, that I find the most interesting 
and that I think I managed to understand (somehow...). I shall insist on the algebraic aspects of 
the quantum formalism. 

In my opinion discussing and comparing the various formulations is useful in order to get 
a better understanding of the coherence and the strength of the quantum formalism. This is im- 
portant when discussing which features of quantum mechanics are basic principles and which 
ones are just natural consequences of the former. Indeed this depends on the different formu- 
lations. For instance the Born rule or the projection postulate are postulates in the standard 
formulation, while in some other formulations they are mere consequences of the postulates. 
This is also important for understanding the relation between quantum physics and special 
relativity through their common roots, causality, locality and reversibility. 

Discussing the different formulations is useful to discuss these issues, in particular when 
considering the relations between quantum theory, information theory and quantum gravity. 

These notes started from: (i) a spin-off of more standard lecture notes for a master course 
in quantum field theory and its applications to statistical physics, (ii) a growing interest^ in 
understanding what was going on in the fields of quantum information, of quantum mea- 
surements and of the foundational studies of the quantum formalism, (iii) a course that I was 
kindly asked to give at the Institut de Physique Theorique (my lab) and at the graduate school 
of physics of the Paris Area (ED107) in May-June 2012, (iv) a short Letter [DavllJ about re- 
versibility in quantum mechanics that I published last year. These notes can be considered 
partly as a very extended version of this letter. 

1.2 Organization 

After this introductory section, the second section is a reminder of the basic concepts of clas- 
sical physics, of probabilities and of the standard (canonical) and path integral formulations of 
quantum physics. I tried to introduce in a consistent way the important classical concepts of 
states, observables and probabilities, which are of course crucial in the formulations of quan- 
tum mechanics. I discuss in particular the concept of quantum probabilities and the issue of 
reversibility in quantum mechanics in the last subsection. 

The third section is devoted to a presentation and a discussion of the algebraic formulation 
of quantum mechanics and of quantum field theory, based on operator algebras. Several as- 
pects of the discussion are original. Firstly I justify the appearance of abstract C*-algebras of 
observables using arguments based on causality and reversibility. In particular the existence 
of a * -involution (corresponding to conjugation) is argued to follow from the assumption of 
reversibility for the quantum probabilities. Secondly, the formulation is based on real algebras, 
not complex ones as usually done, and I explain why this is more natural. I give the math- 
ematical references which justify that the GNS theorem, which ensures that complex abstract 
C* -algebras are always representable as algebras of operators on a Hilbert space, is also valid 
for real algebras. The standard physical arguments for the use of complex algebras are only 
given after the general construction. The rest of the presentation is shorter and quite standard. 

The fourth section is devoted to one of the formulations of the so-called quantum logic 
formalism. This formalism is much less popular outside the community interested in the foun- 
dational basis of quantum mechanics, and in mathematics, but deserves to be better known. 
Indeed, it provides a convincing justification of the algebraic structure of quantum mechan- 

1. A standard syndrome for the physicist over 50... encouraged (for useful purpose) by the European Research 
Council 
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ics, which for an important part is still postulated in the algebraic formalism. Again, if the 
global content is not original, I try to present the quantum logic formalism in a similar light 
than the algebraic formalism, pointing out which aspects are linked to causality, which ones 
to reversibility, and which ones to locality and separability. This way to present the quantum 
logic formalism is original, I think. Finally, I discuss in much more details than is usually done 
Gleason's theorem, a very important theorem of Hilbert space geometry and operator alge- 
bras, which justify the Born rule and is also very important when discussing hidden variable 
theories. 

The final section contains short, introductory and more standard discussions of some other 
questions about the quantum formalism. I present some recent approaches based on quan- 
tum information. I discuss some features of quantum correlations: entanglement, entropic 
inequalities, the Tisrelson bound for bipartite systems. The problems with hidden variables, 
contextuality non-locality, are reviewed. Some very basic features of quantum measurements 
are recalled. Then I stress the difference between 

- the various formalizations (representations) of quantum mechanics; 

- the various possible interpretations of this formalism; 

I finish this section with a few very standard remarks on the problem of quantum gravity. 

1.3 What this course is not! 

These notes are (tentatively) aimed at a non specialized audience: graduate students and 
more advanced researchers. The mathematical formalism is the main subject of the course, but 
it will be presented and discussed at a not too abstract, rigorous or advanced level. Therefore 
these notes do not intend to be: 

- a real course of mathematics or of mathematical physics; 

- a real physics course on high energy quantum physics, on atomic physics and quantum 
optics, of quantum condensed matter, discussing the physics of specific systems and their 
applications; 

- a course on what is not quantum mechanics; 

- a course on the history of quantum physics; 

- a course on the present sociology of quantum physics; 

- a course on the philosophical and epistemological aspects of quantum physics. 

But I hope that it could be useful as an introduction to these topics. Please keep in mind that 
this is not a course made by a specialist, it is rather a course made by an amateur, for amateurs! 

1.4 Acknowledgements 

I thank Roger Balian, Michel Bauer, Marie-Claude David, Kirone Mallick and Vincent Pasquier 
for their interest and their advices. 
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Chapter 2 

Reminders 



I first start by reminders of classical mechanics, probabilities and standard quantum me- 
chanics. This is mostly very standard material, taken from notes of my graduate courses (Mas- 
ter level) in Quantum Field Theory. The sections on classical and quantum probabilities are a 
bit more original. 



2.1 Classical mechanics 

The standard books on classical mechanics are the books by Landau & Lifshitz [LL76| and 
the book by A. Arnold IIAV W89II . 



2.1.1 Lagrangian formulation 

Consider the simplest system: a non relativistic particle of mass m in a one dimensional 
space (a line). Its coordinate (position) is denoted q. It is submitted to a conservative force 
which derives from a potential V(q). The potential is independent of time. The velocity is 
q{t) = . The dynamics of the particle is given by Newton's equation 

mq(t) = -§- q V(q) (2.1.1) 

The equation of motion derives from the least action principle. The classical trajectories ex- 
tremize the action S 

S[q}= j* f dtL{q{t),q{t)) , L{q,q) = ^q 2 -V(q) (2.1.2) 

L is the lagrangian. Under the variations the initial and final positions are fixed q(tj) = q\, 
q(tf) = qf. So one requires that a classical solution q c (t) satisfies 

q{t) = q c {t) + 5q{t) , Sq(t t ) = 5q{t f ) = S[q] = S[q c ] + 0(5q 2 ) (2.1.3) 
This functional derivative equation leads to the Euler-Lagrange equation 

Sty] = o ± 9L (^4) = dL(q,q) 

Sq(t) ' ' dt 34(f) dq(t) ( ' ' ' 

which leads to 2.1.1 This generalizes to many systems: higher dimensional space, many par- 
ticles, systems with internal degrees of freedom, time-dependent potentials, fields, as long as 
there is no irreversibility (dissipation). 

A good understanding of the origin of the least action principle in classical mechanics comes 
in fact from the path integral formulation of quantum mechanics. 
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q 




Figure 2.1: The least action principle: the classical trajectoire (full line) extremizes the action. 
Under a variation (dashed line) 5S = 0. The initial and final positions are kept fixed. 



2.1.2 Hamiltonian formulation 

2.1.2.a - Phase space and Hamiltonian: 

The Hamiltonian formulation is in general equivalent, but slightly more general than the 
Lagrangian formulation. For a classical system with n degrees of freedom, a state of a system 
is a point x in the phase space O of the system. O is a manifold with even dimension 2n. The 
evolution equations are flow equations (first order differential equations in time) in the pause 
space. 

For the particle in dimension d = 1 in a potential there is one degree of freedom, n = 1 and 
dim(Q)=2. The two coordinates in phase space are the position q et the momentum p. 

*={q,V) (2.1.5) 

The Hamiltonian is 

H(q,p) = £ n +V(q) (2.1.6) 
The equations of motion are the Hamilton equations 

so the relation between the momentum and the velocity p = mq is a dynamical relation. The 
Hamilton equations derive also from a variational principle. To find the classical trajectory 
such that q{t\) = q\, q(t2) = qi one extremizes the action functional Sh 

SH[q,p] = f 2 dt [p(t)q(t)-H(q(t),p(t))] (2.1.8) 
Jti 

with respect to variations of q(t) and of p(t), q{t) being fixed at the initial and final times t = t\ 
et tj, but p{t) being free at t = t\ and tj. Indeed, the functional derivatives of Sh are 

H = -pM-fw<M0), $g -«W- f«'M*)> P-W) 

The change of variables [q,q) —> {q, p) and of action fonctionals S(q,q) — > Sn{q, p) between 
the Lagrangian and the Hamiltonian formalism corresponds to a Legendre transform. 
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q f 




p 



Figure 2.2: Least action principle in phase space: The classical trajectory (full line) extremizes 
the action Sh [q, p] ■ The initial and final positions are fixed. The initial and final momenta are 
free. Their actual value is given by the variational principle as a function of the initial and final 
positions and times. 



2.1.2.b - Hamilton-Jacobi equation 

For a classical trajectory q c \(t) solution of the equations of motion, the action functionals Sh 
and S are equal! If we fix the initial time f i and the initial position q\, this classical action can 
be considered as a function of the final time £2 = £ and of the final position ^(£2) = tf2 = This 
function is called the Hamilton-Jacobi action, or the Hamilton function, and I note it S(q, £) = 
S H] (q, £) to be explicit (the initial conditions q{t\) = q\ being implicit) 

S(q, £) = S Hi (q, t) = S[q c i] with q c \ classical solution such that ^(£2) = q, h = £ 



Using the equations of motion it is easy to see that the evolution with the final time £ of this 
function S(q, £) is given by the differential equation 



with H the Hamiltonian function. This is is a first order differential equation with respect to 
the final time £. It is called the Hamilton-Jacobi equation. 

From this equation on can show that (the initial conditions (£1,^1) being fixed) the impul- 
sion p and the total energy E of the particle at the final time £, expressed as a function of its final 
position q and of £, are 



These formulas extends to the case of systems with n degrees of freedom and of mote gerenal 
Hamiltonians. Positions and momenta are now n components vectors 



and where t\ and q(h) = q\ are kept fixed 



(2.1.10) 




(2.1.11) 




(2.1.12) 



P = {pi} 



i = !,••• f 



n 



(2.1.13) 
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2.1.2.C - Symplectic manifolds 



A more general situation is a general system whose phase space O is a manifold with an 
even dimension N = In, not necessarily the Euclidean space R N , but with for instance a non 
trivial topology. Locally O is described by local coordinates x = {x 1 , i = l,2n} (warning! The 
x — i are coordinate in phase space, not some position coordinates in physical space). 

Hamiltonian dynamics requires a symplectic structure on Q. This symplectic structure al- 
lows to define (or amounts to define) the Poisson brackets. O is a symplectic manifold if it is em- 
bodied with an antisymmetric 2-form co (a degree 2 differential form) which is non-degenerate 
and closed (dco = 0). This means that to each point x G Q is associated (in the coordinate 
system {x} 

1 

co(x) = -Li>ij(x)dx l A dx 1 
caracterized by an antisymmetric matrix In x In which is invertible 

Wjj(x) = -Wji(x) , det(o;) ^ 

dx 1 A dxi is the antisymmetric product (exterior product) of the two 1-forms dx 1 and dxK This 
form is closed. Its exterior derivative dco is zero 



1 

doj(x) = — diOJj k (x) dx 1 A dx' A dx k = 

i,j,k 



In term of components this means 

V i\ < h < is 







The fact that a; is a differential form means that under a local change of coordinates x 
phase space) the components of the form change as 



x' (in 



x -4 x 

that is for the components 



co = co(x)ij dx 1 A dx 1 = co'(x')jj dx" A dx' 1 
dx k dx 1 



1 " y/ ^dx' l dx" 

The Poisson brackets will be defined in the next subsection. 



For the particule on a ligne n 
co = dq A dp. Its components are 



co'(x')ij = co(x)u 

ct su 
1, n = R 2 , x 



(2.1.14) 



(q, p), The symplectic form is simply 

co = K) = J) 
In d = n dimensions O = R 2 ", x = (q l , p l ), and co = \ J^dq' A dp 1 , i.e. 

i 

\ 

-1 ••• 

(coij) 



( 1 

-1 









1 
-1 



V 



(2.1.15) 



I 



The Darboux theorem shows that for any symplectic manifold Q with a symplectic form co, 
it is always possible to find local coordinate systems (in the neighborhood of any point) such 
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that the symplectic form takes the form 2.1.15 (a; is constant and is a direct sum of antisymmet- 
ric symbols). The (q l , p 1 ) are local pairs of conjugate variables. 

The fact that locally the symplectic form may be written under its generic constant form 
means that symplectic geometry is characterized only by global invariants, not by local ones. 
This is different from Riemaniann geometry where the metric tensor gu cannot in general be 
written in its flat form 7z« = 5u, because of curvature, and where there are local invariants. 

2.1.2.d - Observables, Poisson brackets 

The observables of the system defined by a symplectic phase espace Q may be identified 
with the ("sufficiently regular") real functions on O. The value of an observable / for the 
system in the state x is simply /(x). Of course observables may depend explicitly on the time t 
in addition on x. 

system in state x — > measured value of / = f(x) (2.1.16) 

For two differentiable functions (observables) / and g, the Poisson bracket {/, g} w is the func- 
tion (observable) defined by 

{f,g}Jx)=Ji(x)d i f(x)d j8 (x) with d i= A and vfi (x) = (v,-\xj) u (2.1.17) 



'l 



the matrix elements of the inverse of the antisymmetric matrix co(x). When no ambiguity are 
present, I shall omit the subscript co. In a canonical local coordinate system (Darboux coordi- 
nates) the Poisson bracket is 

{/-*> = £ ||-^| - <*■«> 

The Poisson bracket is antisymmetric 

{f,g} = ~{g>f} ( 2 -!-!9) 

The fact that it involves first order derivatives only implies the Leibnitz rule (the Poisson 
bracket acts as a derivation) 

if,gh} = U,g}h + g{f,h} (2.1.20) 
The fact that the symplectic form is closed dco = is equivalent to the Jacobi identity 

{f. {gM} + {*, {*,/}} + {K {f,g}} = (2.1.21) 
Knowing the Poisson bracket { , } is equivalent to know the symplectic form co since 

{j,x>}=(Ji{x) (2.1.22) 

2.1.2.e - Dynamics, Hamiltonian flows: 

In Hamiltonian mechanics, the dynamics of the system is generated by an Hamiltonian 
function H. The Hamiltonian is a real regular (in general differentiable) function on the phase 
space Q — > R. The state of the system x(f ) changes with time and the evolution equation for 
the coordinates x'(t) in phase space (the Hamilton equation) take the general form (for a time 
in dependent Hamiltonian) 

x l (t) = = {x>(t),H} = w'>(x(t))djH(x(t)) (2.1.23) 
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This form involves the Poisson Bracket and is covariant under local changes of coordinates in 
phase espace. The equations are flow equations of the general form 

* i (0 = ^=P(x(0) (2-1-24) 

but the vector field F = co^djH is special and derives from H. The flow, i.e. the application 
<p: Q x R —> Q is called the Hamiltonian flow associated to H. The evolution functions O t (x 
defined by 

x(f = 0)=x =► (p t (x)=x(t) (2.1.25) 
form a group of transformations (as long as H is independent of the time) 

<Ph+h = <Ph §h (2.1.26) 

More generally, let us consider a (time independent) observable / (a function on Q). The evo- 
lution of the value of / for a dynamical state x(f), f(x, t) = f(x(t)) where x(t) = (pt(x), obeys 
the equation 

^ = « = {/,H}(x( () ) (2.1.27, 

where the r.h.s. is the Poisson bracket of the observable / and the Hamiltonian H. In particular 
(when H is independent of t) the energy E(t) = H(x(f)) is conserved 

m^t) = dH(x(t)) = o 

dt dt 

2.1.2.f - The Liouville measure 

The symplectic form to defines an invariant volume element dji on the phase space O. 

2n 

dfi(x) = to" = Y\dx { \co\ 1/2 , \w\ = | det(a7 i; -)| (2.1.29) 

i=\ 

This defines the so-called Liouville measure on Q. This mesure is invariant under all the Hami- 
tonian flows. 

2.1.2.g - Example: the classical spin 

The simplest example of a system with a non trivial phase space is the classical spin (the 
classical top with constant total angular momentum). The states of the spin are labelled by unit 
3-components vector n = (ni, 112,1*13), \n\ = 1 (the direction of the angular momentum). Thus 
the phase space is the 2-dimensional unit sphere and is compact 

= ^2 

The classical precession equation 

can be written in Hamiltonian form. B is a vector in R 3 , possibly a 3-component vector field on 
the sphere depending on n. 

There is a symplectic structure on Q. It is related to the natural complex structure on £2 (the 
Riemann sphere). The Poisson bracket of two functions / and g on £2 is defined as 

{/^} = (V/xVg)-n. 
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The gradient field V/ of a function / on the sphere is a vector field tangent to the sphere, so 
V/ x Vg is normal to the sphere, hence collinear with n. In spherical coordinates 



n = (sin 6 cos (/>, sin 9 sin cos 8) 



the Poisson bracket is simply 



{f,g} 



d_ldg_dgd_l 
sin0 V 30 # ded( P 



Admissible local Darboux coordinates x = (x 1 ,* 2 ) such that co = dx 1 A c?x 2 must be locally 
orthogonal, area preserving mappings. Examples are 

- "action-angle" variables (the Lambert cylindrical equal-area projection) 

x = (cos 6, (p) 

- or plane coordinates (the Lambert azimuthal equal-area projection). 

x = (2sin(0/2) coscp, 2sin(0/2) sin<£) 





Figure 2.3: The Lambert cylindrical and azimuthal coordinates 



With this Poisson bracket, the Hamiltonian which generates the precession dynamics is simply 
(for constant B) 

H = B-n 



2.1.2.h - Statistical states, distribution functions, the Liouville equation 

We now consider statistical ensembles. If we have only some partial information on the 
state of the system, to this information is associated a statistical (or mixed) state <p. This statis- 
tical state is described by a probability distribution on the phase space Q, that we write 

dp v {x) = d}i{x) p v {x) (2.1.30) 

with d}i{x) the Liouville measure and p<p(x) the probability density, a non negative distribution 
(function) such that 

|D<p(x)>0 , J d^x)p v {x) = 1 (2.1.31) 
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On a given statistical state cp the expectation for an observable / (its expectation value, i.e. its 
mean value if we perform a large number of measurements of / on independent systems in the 
same state cp) is 

{f) V = J n dn(x) Pv (x)f(x) (2.1.32) 

When the system evolves according to some Hamiltonian flow <p t generated by a Hamil- 
tonian H, the statistical state depends on time cp — > <p(t), as well as the distribution function 
P<p ~ * P ai(i)- 9 being the initial state of the system at time t = 0, we can denote this function 

i°<p(f)( x ) = M x ' f ) (2.1.33) 
This time dependent distribution function is given by 

p 9 (x(t),t) =p v (x) , x(O = 0t(x) (2.1.34) 

(using the fact that the Liouville measure is conserved by the Hamiltonian flow). Using the evo- 
lution equation for x(f ) |2.1.23 one obtains the evolution equation for the distribution function 
Pcp(x, t), called the Liouville equation 

-p cp {x,t) = {H r p v }{x,t) (2.1.35) 

{ H, pep } is the Poisson bracket of the Hamiltonian H and of the density function p^, considered 
of course as a function of x only (time is fixed in the r.h.s. of 2.1.35) . 

With these notations, the expectation of the observable / depends on the time t , and is 
given by the two equivalent integrals 

(/HO = J n dn(x) Pv (x)f(x(t)) = jj ¥ (x)p v {x,t)f{x) (2.1.36) 

Of course when the state of the system is a "pure state" (<p pU re = x o) the distribution function 
is a Dirac measure |0 pure (x) = 5{x — xq) and the Liouville equation leads to 

p puie (x,t) = 6(x-x(t)) , x(f) =<M*o) (2.1.37) 



2.1.2A - Canonical transformations 

The Hamiltonian flow is an example of canonical transformations. Canonical transforma- 
tions C are (bijective) mappings O — > O which preserve the symplectic structure. Denoting the 
image of the point x G Q (by the canonical transformation C) by X 

x4x = C(x) (2.1.38) 
This means simply that the symplectic form co* defined by 

cv*(x)=co(X) (2.1.39) 

is equal to the original form 

co* =co (2.1.40) 

co* is called the pullback of the symplectic form co by the mapping C and is also denoted C*co. 
In a given coordinate system such that 

x = , X = (X*) (2.1.41) 
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|2.1.39| means that to and to* read 

co(x) = Wij(x) dx i A dx> , to* (x) = w^X) dX { A dX' (2.1.42) 
so that the components of a>* are 

C is a canonical transformation if 

^■(x) = co?-(x) (2.1.44) 

Canonical transformations are the transformations that preserve the Poisson brackets. Let 
/ and g be two observables (functions Q — > R and F = f o C^ 1 and G = g o their transform 
by the transformation C 

/(x)=F(X) , g(x) = G(X) (2.1.45) 
C is a canonical transformation if 

{f,g}w = {*,G}» (2-1.46) 

Taking for / and g the coordinate change x 1 — > X' itself, canonical transformations are change 
of coordinates such that 

{X\Xi} = {x i ,xi} (2.1.47) 

Canonical transformations are very useful tools in classical mechanics. They are the classi- 
cal analog of unitary transformations in quantum mechanics. 

In the simple example of the classical spin, the canonical transformations are simply the 
smooth area preserving diffeomorphisms of the 2 dimensional sphere. 



2.1.2.) - Along the Hamiltonian flow 

As an application, one can treat the Hamiltonian flow (p t as a time dependent change of 
coordinate in phase space (a change of reference frame) and look at the dynamics of the system 
in this new frame which moves with the system. In this new coordinates, denoted x = {x 1 }, if 
at time t = the system is in an initial state x = xo, at time t it is of course still in the same state 
x(f) = x . 

It is the observables / which become time dependent. Indeed, if in the original (time 
independent) coordinate system one considers a time independent observable / (a function 
x — > /(x)), in the new coordinate system one must consider the time dependent observable /, 
defined by 

f(x,t) =/(x(0) with x(t) = (p t (x) i.e. x(0)=x (2.1.48) 

This time dependent observable f(x,t) describes how the value of the observable / evolves 
with the time t, when expressed as a function of the initial state x. Of course the time evolution 
of / depends on the dynamocs of the system, hence of the Hamiltonian H. The dynamics for 
the observables is given by evolution equation (similar to the Liouville equation, up to a sign) 

d l = -{H,f} i.e. tfM = {/,H}(*,f) (2-1.49) 

In this dynamical frame the Hamiltonian is still time independent, i.e. H = H, since its evolu- 
tion equation is 

— = -{H,H}=0 (2.1.50) 
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The Poisson bracket is always the Poisson bracket for the symplectic form to, since to is con- 
served by the canonical transformations, in particular this change of reference frame. 

This change of frame corresponds to a change from a representation of the dynamics by 
a flow of the states in phase space, the observables being fixed functions, to a representation 
where the states do not move, but where there is a flow for the functions. This is the analog 
for Hamiltonian flows to what is done in fluid dynamics: going from the Eulerian specification 
(the fluid moves in a fixed coordinate system) to the Lagrangian specification (the coordinate 
system moves along the fluid). But these two representations are of course the classical analog 
of the Schrodinger picture (vector states evolves, operators are fixed) and of the Heisenberg 
picture (vector states are fixed, operators depend on time) in quantum mechanics. 

2.1.3 The commutative algebra of observables 

Let us adopt a slightly more abstract point of view. The (real or) complex functions on phase 
space / Q — > C form a commutative algebra A with the standard addition and multiplication 
laws. 

(/ + g) (x) = f(x) +g(x) , (fg) (x) = f(x)g(x) (2.1.51) 

This is more generally true if Q = X is simply a locally compact topological space, and A the 
algebra of continuous functions with compact support. 

Statistical states (probability distributions on X) are then normalized positive linear forms 
(p on A 

V(*f + Pg)=*f(f)+P<P(g), ?(//*) >0, <p(l) = l (2.1.52) 
The sup or £°° norm, defined as 

||/|| 2 = sup|/(x)| 2 = sup <p(|/(x)| 2 ) (2.1.53) 

xeX f states 

has clearly the following properties 

11/11 = 11/1, Il/Sll < 11/11 llsll / ll/ril = ll/H 2 (2-1.54) 

and A is complete under this norm. This makes the algebra A a so-called commutative C*- 
algebra. 

For any element x G X, consider the subalgebra of the functions that vanish at x 

X x :{fe A; f(x) = 0} (2.1.55) 

They are maximal ideals of A, (left-)ideals X of an algebra A being subalgebras of A such that 
x € 2 and y G A implies xy G X. It is easy to show that the set of maximal ideals of A = C(X) 
is isomorphic to X, and that A/T x = C the target space. 

Now a famous theorem by Gelfand and Naimark states that the reciprocal is true. Any 
commutative C*-algebra is isomorphic to the algebra of continuous functions on some topo- 
logical (locally compact) space X! This seems a formal result (the space X and its topology 
may be quite wild, and very far from a regular smooth manifold), but it is important to keep in 
mind that a mathematical object (here a topological space) can be defined intrinsically (by its 
elements) or equivalently by the abstract properties of some set of functions from this object to 
some other object (here the commutative algebra of observables). This modern point of view 
in mathematics (basically this idea is at the basis of the category formalism) is also important 
in modern physics, as we shall see later in the quantum case. 

For the Hamiltonian systems, the algebra of (differentiable) functions on Q is equipped 
with a additional product, the Poisson bracket {/,#}■ The corresponding algebra, with the 
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three laws (addition, multiplication, Poisson bracket) is now a commutative Poisson algebra. 
A Poisson algebra is a (not necessarily commutative) associative algebra with a bracket that 
satisfies |2TT9l|2.1.20| and |2.1.21| 



2.1.4 "Axiomatics" 

The most general formulation for classical Hamiltonian dynamics is that of Poisson mani- 
fold. This is a more general formulation that symplectic manifolds, since it encompasses special 
situations where the symplectic form is degenerate. Poisson manifolds can in general be split 
(foliated) into "symplectic leaves" embodied with a well defined induced symplectic structure. 

The fact that in classical mechanics dynamics are given by Hamiltonian flows on a phase 
space which is a symplectic or Poisson manifold can be somehow justified, if one assumes that 
the possible dynamics are flows generated by some smooth vector fields, that these flows are 
generated by conserved quantities (Hamiltonians) and that these dynamics are covariant under 
change of frames generated by these flows (existence and invariance of canonical transforma- 
tions). 

However a real understanding and justification of classical Hamiltonian dynamics comes 
from quantum mechanics. Indeed, the Poisson bracket structure is the "classical limit" of the 
commutators of observables (operators) in quantum mechanics, and the canonical transforma- 
tions are the classical version of unitary transformations. 



2.2 Probabilities 

Probabilities are an important aspect of classical physics and are one of the essential com- 
ponents of quantum physics. Without going into any details and any formalism, I think it is 
important to recall the two main ways to consider and use probabilities in statistics and physics: 
the frequentist point of view and the Bayesian point of view. At the level considered here, these 
are different point of views on the same mathematical formalism, and on its use. As we shall 
see, in some sense quantum mechanics forces us to treat them on the same footing. There are 
of course many different, more subtle and more precise mathematical as well as philosophical 
points of view on probability theory. I shall not enter in any discussion about the merits and 
the consistency of objective probabilities versus subjective probabilities. 

Amongst many standard references on the mathematical formalism of probability, there is 
the book by Kolmogorov [Kol50|, and the book by Feller [Fel68|. See also the quick introduction 
for and by a physicist by M. Bauer (in french) [Bau09J. References on Bayesian probabilities are 
the books by de Finetti [dF74j, by Jaynes [| Jay03| and the article by Cox MCox46L 

2.2.1 The frequentist point of view 

The frequentist point of view is the most familiar and the most used in statistical physics, 
dynamical systems, as well as in mathematics (it is at the basis of the formulation of modern 
probability theory from the beginning of 20th century, in particular for the Kolmogorov ax- 
iomatic formulation of probabilities). Roughly speaking, probabilities represent a measure of 
ignorance on the state of a system, coming for instance from: uncertainty on its initial state, 
uncertainty on its dynamical evolution due to uncertainty on the dynamics, or high sensibility 
to the initial conditions (chaos). Then probabilities are asymptotic frequencies of events (mea- 
surements) if we repeat observations on systems prepared by the same initial procedure. More 
precisely, one has a set Q of samples (the sample space), a c-algebra J 7 of "measurable" subsets 
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of the sample space Q, and a measure P on T (equivalently a probability measure jijr on O. 
This probability measure is a priori given. 



2.2.2 The Bayesian point of view 

The so called Bayesian point of view is somehow broader, and of use in statistics, game 
theory, economy, but also in experimental sciences. It is also closer to the initial formulations of 
probabilities (or "chance") in the 18th and 19th centuries. It has been reviewed by statisticians 
like de Finetti or Jaynes (among others) in the 20th century. 

Probabilities are considered as qualitative estimates for the "plausibility" of some proposi- 
tion (it can be the result of some observation), given some "state of knowledge" on a system. 
The rules that these probabilities must satisfy are constrained by some logical principles (ob- 
jectivist point of view where the degree of plausibility is constructed by a "rational agent"), or 
may correspond simply to some "degree of personal belief" of propositions (subjectivist point 
of view). 



2.2.3 Conditional probabilities 

The basic rules are the same in the different formulations. A most important concept is 
conditional probabilities P(A\B) (the probability of A, B being given), and the Bayes (or condi- 
tional probability) relation 

where P(A) and P(B) are the initial probabilities for A and B (the prior), and P(A\B) P(B\A) 
the conditional probabilities. 



Frequentist: In the frequentist formulation P(A\B) is the frequency of A, once we have se- 
lected the samples such that B is true. Bayes formula has the simple representation with Venn 
diagrams in the set of samples 




Figure 2.4: Venn representation of the conditional probabilities formula 



Bayesian: In the Baysian formulation (see for instance the book by Jaynes), one may con- 
sider every probabilities as conditional probabilities. For instance Pc(A) = P(A\C), where the 
proposition C corresponds to the "prior knowledge" that leads to the probability assignment 
pc(A) for A (so Pc is the probability distribution). If AB means the proposition "A and B" 
(A A B or A + B), Bayes formula follows from the "product rule" 

P(AB\C) = P(A\BC)P(B\C) = P(B\AC)P{A\C) (2.2.2) 

whose meaning is the following: given C, if I already know the plausibility for AB of being true 
(P(AB |C)), and the plausibility for B of being true (the prior P(B\C)), the formula tells me how 
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I should modify the plausibility for A of being true, if I learn that B is true (P(A\BC)). Together 
with the "sum rule" 

P(A\C)+P( r A\C) = l (2.2.3) 
(-i is the negation), these are the basic rules of probability theory in this framework. 

2.3 Quantum mechanics: "canonical" formulation 

Let us recall the so called "canonical formalism" of quantum mechanics, as it is presented 
in textbooks. This is the standard presentation when one uses the "correspondence principle" 
to quantize a classical non-relativistic system, or simple field theories. 

There is of course an enormous number of good books on quantum mechanics and quan- 
tum field theory. Among the very first books on quantum mechanics, those of P. A. Dirac (1930) 
BDir3QH and J. von Neumann (1932) MvN32| ( HvN55l for the english traduction of 1955) are still 
very useful and valuable. Modern books with a contemporary view and treatment of the re- 
cent developments are the books by Cohen-Tanoudji, Laloe & Diu IICTDL77I , by M. le Bellac 
IlLBTTl . by Auletta, Fortunato and Parisi IIAFP09I . 

Some standard references on quantum field theory are the books by J. Zinn-Justin [ZJ02|, 
by A. Zee [|Zee03| (in a very different style). Refernce more oriented towards mathematical 
physics will be given later. 

Amongst the numerous references on the questions of the foundation and the interpretation 
of quantum mechanics, one may look at the encyclopedic review by Auletta HAulOll , and at the 
recent shorter book by F. Laloe |Lall2| (see also BLalllllLalOll ). More later. 

2.3.1 Principles 
2.3.1.a - Pure states: 

The phase space O of classical mechanics is replaced by the complex Hilbert space H of 
states. Elements of H (vectors) are denoted xp or \xp) ("kets" in Dirac notations). The scalar 
product of two vectors xp and xp' in TL is denoted xp*-xp' or (xp\xp'). The xp* = (xp\ are the "bra" 
and belong to the dual W* of TL. Note that in the mathematical litterature the scalar product is 
often noted in the opposite order (ip\ip') = xp'-xp*. We shall stick to the physicists notations. 

Pure quantum states are rays of the Hilbert space, i.e. 1 dimensional subspaces of W. They 
correspond to unit norm vectors \tp), such that \\xp\\ 2 = (ip \ip) = 1, and modulo an arbitrary 
phase \xp) = e ld \xp). 

2.3.1.b - Observables: 

The physical observables A are the self-adjoint operators on 7i (Hermitian or symmetric 
operators), such that A = A + , where the conjugation is defined by (A f xp'\xp) = (xp f \Axp). Note 
that the conjugation A f is rather denoted A* in the mathematical literature, and in some chapter 
we shall use this notation, when dealing with general Hilbert spaces not necessarily complex. 

The operators on T-L form an associative, but non commutative operator algebra. Any set of 
of commuting operators {A} corresponds to a set of classically compatible observables, which 
can be measured independently. 

2.3.1.C - Measurements, Born principle: 

The outcome of the measurement of an observable A on a state xp is in general not deter- 
ministic. Quantum mechanics give only probabilities, and in particular the expectation value 
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of the outcomes. This expectation value is given by the Born rule 

(Ah = (ip\A\tp) = {xp\Axp) (2.3.1) 

For compatible (commuting) observables the probabilities of outcome obey the standard rule 
of probabilities and these measurements can be repeated and performed independently. 

This implies in particular that the possible outcomes of the measurement of A must belong 
to the spectrum of A, i.e. can only be eigenvalues of A (I consider the simple case where A has 
a discrete spectrum). Moreover the probability to get the outcome a, (a, being the eigenvalue of 
A and |z) the corresponding eigenvector) is the modulus squared of the probability amplitude 

m 

probability of outcome of A = a, in the state | xp) = pi = \{i\ip)\ 2 

It follows also that quantum measurements are irreversible process. In ideal measurements 
or non destructive measurements which can be repeated, if the outcome of A was a,, after the 
measurement the system is found to be in the eigenstate |z). This is the projection postulate. 

In the more general situation where the eigenspace of A associated to the eigenvalue a, is a 
higher dimensional subspace VJ, the state of the system is obtained by applying the orthogonal 
projector P, onto V, to the initial state \xp). Things are more subtle in the case of a continuous 
spectrum and non normalizable eigenstates. 

At that stage I do not discuss what it means to "prepare a system in a given state", what 
"represents" the state vector, what is really a measurement process (the problem of quantum 
measurement) and what means the projection postulate. We shall come back to some of these 
questions along the course. 

2.3.1.d - Unitary dynamics 

For a closed system, the time evolution of the states is linear and it must preserve the prob- 
abilities, hence the scalar product (.|.). Therefore is given by unitary transformations U(t) such 
that = U f . Again if the system is isolated the time evolution form a multiplicative group 
acting on the Hilbert space and its algebra of observables, hence it is generated by an Hamilto- 
nian self-adjoint operator H 

U(0=exp(l H ) 
The evolution equations for states and observables are discussed below. 

2.3.1.e - Multipartite systems: 

Assuming that it is possible to perform independent measurements on two independent 
(causally) subsystems Si and S2 implies (at least in the finite dimensional case) that the Hilbert 
space H of the states of the composite system S = "Si U S 2 ' is the tensor product of the Hilbert 
spaces Hi and H2 of the two subsystems. 

H = H 1 ®H 1 

This implies the existence for the system S of generic "entangled states" between the two sub- 
systems 

|Y) = c\ip) l ®\<t>) 2 + c'W) l ®\<p') 2 
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Entanglement is one of the most important feature of quantum mechanics, and has no coun- 
terpart in classical mechanics. It is entanglement that leads to many of the counter-intuitive 
features of quantum mechanics, but it leads also to many of its interesting aspects and to some 
of its greatest success. 

2.3.1.f - Correspondence principe, canonical quantization 

The correspondence principle has been very important in the elaboration of quantum me- 
chanics. Here by correspondence principle I mean that when quantizing a classical system, 
often one can associate to canonically conjugate variables (q,, p\) self-adjoint operators (Q,-, P,) 
that satisfy the canonical commutation relations 

{q i ,p i }=5 ij [Q it Pj]=m i} (2.3.2) 

and to take has Hamiltonian the operator obtained by replacing in the classical Hamiltonian 
the variables (qi, pi) by the corresponding operators. 

For instance, for the particle on a line in a potential, one takes as (Q, P) the position and the 
momentum and for the Hamiltonian 

H = £ + y(Q) (233) 

The usual explicit representation is 7i = £ 2 (1R), the states \tp) correspond to the wave functions 
xp(q), and the operators are represented as 



Q = q 




(2.3.4) 



2.3.2 Representations of quantum mechanics 

The representation of states and observables as vectors and operators is invariant under 
global unitary transformations (the analog of canonical transformations in classical mechan- 
ics). These unitary transformations may depend on time. Therefore there are different rep- 
resentations of the dynamics in quantum mechanics. I recall the two main representations. 
La representation des etats et des observables etant invariante par des transformations uni- 
taires globales, pouvant dependre du temps (l'equivalent des transformations canoniques clas- 
siques), les etats et la dynamique du systeme peuvent se representer de plusieurs facon equiv- 
alentes. Je rappelle ici les deux principales. 

2.3.2.a - The Schrodinger picture 

It is the most simple, and the most used in non relativistic quantum mechanics, in canonical 
quantization and to formulate the path integral. In the Schrodinger picture the states xp (the kets 
\tp)) evolve with time and are noted xp(t). The observables are represented by time independent 
operators. The evolution is given by the Schrodinger equation 

ih-^=Htp (2.3.5) 

The expectation value of an observable A measured at time t for a system in the state xp is thus 

(A) m = (rp(t)\A\ip(t)) (2.3.6) 
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The evolution operator 17(f) is defined by 

y(t = 0) = xp -> xp{t) = U(t)rp (2.3.7) 

It is given by 

U{t) = exp (2-3.8) 

and obeys the evolution equation 

ih — LT(0 = H U(t) ; LT(0) = 1 (2.3.9) 
This generalizes easily to the case where the Hamiltonian depends explicitly of the time t. Then 

ihj- t U(t,t )=H(t)U(t,t ) ; U(t ,t ) = l (2-3.10) 



exp ( A. [ dtH(t) ] = £(ift)~* / dti- • -dt k H(t k )- ■ ■ H(h) (2.3.11) 
V 1 " -'to /J Si Jt <h<-<t k <t 



and 

where T means the time ordered product (more later). 

2.3.2.b - The Heisenberg picture 

This representation is the most useful in relativistic quantum field theory. It is in fact the 
best mathematically fully consistent formulation, since the notion of state in more subtle, in 
particular it depends on the reference frame. It is required for building the relation between 
critical systems and Euclidean quantum field theory (statistical field theory). 

In the Heisenberg representation, the states are redefined as a function of time via the uni- 
tary transformation U(—t) on H, where U(t) is the evolution operator for the Hamiltonian H. 
They are denoted 

\t,t) = U{-t)\tp) (2.3.12) 

The unitary transformation redefines the observables A. They becomes time dependent and 
are denoted A(t) 

A(t) = U(-t)AU(t) (2.3.13) 

The dynamics given by the Schrodinger equation is reabsorbed by the unitary transformation. 
The dynamical states are independent of time! 

\xp(t);t) = U(-t)U(t)\xp) = \xp) (2.3.14) 

The expectation value of an observable A on a state ip at time t is in the Heisenberg represen- 
tation 

(A(t)) f = (ip(ty r t\A(t)\ip(t;,t) = (ip\A(t)\ip) (2.3.15) 

The Schrodinger and Heisenberg representation are indeed equivalent, since they give the same 
result for the physical observable (the expectation values) 

(A) m = (A(t)) f (2.3.16) 

In the Heisenberg representation the Hamitonian H remains independent of time (since it 
commutes with U(t) 

H(t) = H (2.3.17) 
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The time evolution of the operators is given by the evolution equation 



ihj t A(t) = {A(t),H} (2.3.18) 



This is the quantum version of the classical Liouville equation 2.1.49 Of course the Schrodinger 
and the Heisenberg representations are the quantum analog of the two "Eulerian" and' "La- 
grangian" representations of classical mechanics discussed above. 

For the particle in a potential the equations for Q and P are the quantum version of the 
classical Hamilton equations of motion 

jfiit) = Ip(t) , jP{t) = -V(Q(0) (2.3.19) 

For an observable A which depends explicitly of time (in the Schrodinger picture), the evolu- 
tion equation becomes 

ihjA{t) = ift^A(t) + [A^ H] (2.3.20) 
and taking its expectation value in some state xp one obtains Ehrenfest theorem 

ihj- t (A)(t) = i»i(A)(t) + ([A,H])(t) (2.3.21) 



2.3.3 Quantum statistics and the density matrix 
2.3.3.a - The density matrix 

As in classical physics, in general on has only a partial information on the physical system 
one is interested in. Its state has to be described by a concept of statistical or mixed state. But in 
quantum mechanics all the information one can get on a system is provided by the expectation 
values of the observable of the system. Statistics is already there! The pure quantum states 
\tp) are the special "mixed states" with the property that a maximal amount of information can 
be extracted by appropriate sets of compatible measurements on the (ensemble of) state. The 
difference with classical physics is that different maximal sets of information can be extracted 
from the same state if one chose to perform different incompatible sets of measurements. 

The mathematical concept that represents a mixed state is that of density matrix. But before 
discussing this, one can start by noticing that, as in classical physics, an abstract statistical state 
co is fully characterized by the ensemble of the expectation values {A) w of all the observables 
A of the system, measured over the state co. 

{A} co = expectation value of A measured over the state co (2.3.22) 

I denote general statistical states by Greek letters (here co) and pure states by the bra-ket no- 
tation when there is a ambiguity. The co here should not be confused for the notation for the 
symplectic form over the classical phase space of a classical system. We are dealing with quan- 
tum systems and there is no classical phase space anymore. 

From the fact that the observables may be represented as an algebra of operators over the 
Hilbert space H, it is natural to consider that statistical states co corresponds to linear forms 
over the algebra of operators hence applications A — > (A)o,, with the properties 

(aA + bB) u = a(A) w + b(B) w linearity (2.3.23) 

(a + )o, = Ja)co realit y ( 2 - 3 - 24 ) 

(A i A) w > and (1)^ = 1 positivity and normalization (2.3.25) 
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For finite dimensional Hilbert spaces and for the most common infinite dimensional cases (for 
physicists) this is equivalent to state that to any statistical state is associated a normalized pos- 
itive self-adjoint matrix p w such that 

(A) ft , = tr(p ft ,A) (2.3.26) 
This is the density matrix or density operator. It was introduced by J. von Neumann (and L. 



Landau and F. Bloch) in 1927. 2.3.30 is the generalization of Born rule for statistical states. 



For pure states \ip) the density operator is simply the projection operator onto the state 

p<p=\ip)(ip\ (2.3.27) 

Before discussing some properties and features of the density matrix, let me just mention 
that in the physics literature, the term "state" is usually reserved to pure states, while in the 
mathematics literature the term 'state" is used for general statistical states. The denomination 
"pure state" or "extremal state" is used for vectors in the Hilbert state and the associated pro- 
jector. There are in fact some good mathematical reasons to use this general denomination of 
state. 



2.3.3.b - Interpretations 

Let us consider a system whose Hilbert space is finite dimensional (dim('H) = N), in a state 
given by a density matrix p w . p w is a N x N self-adjoint positive matrix. It is diagonalizable 
and its eigenvalues are > 0. If it has 1 < K < N orthonormal eigenvectors labeled by \n) 
(n = 1, • • • K) associated with K non-zero eigenvalues p n (n = 1, • • • K) one can write 

K 

p w =Y,Pn\n){n\ {23.2%) 

with 

0<p n <l, £>„ = 1 (2.3.29) 

ii 

The expectation value of any observable A in the state co is 

(A) w = 5>„(n|A|n) (2.3.30) 

n 

The statistical state co can therefore be viewed as a classical statistical mixture of the K orthonor- 
mal pure states \n), n = 1, • • • K, the probability of the system to be in the pure state |n) being 
equal to p n . 

This point of view is usually sufficient if one wants to think about results of measurements 
on a single instance of the system. But it should not be used to infer statements on how the sys- 
tem has been prepared. One can indeed build a statistical ensemble of independently prepared 
copies of the system corresponding to the state co by picking at random, with probability p n the 
system in the state \n) . But this is not the only way to build a statistical ensemble corresponding 
to co. More precisely, there are many different ways to prepare a statistical ensemble of states 
for the system, by picking with some probability p a copies of the system in different states 
among a pre chosen set { | xp a } } of (a priori not necessarily orthonormal) pure states, which give 
the same density matrix p w . 

This is not a paradox. The difference between the different preparation modes is contained 
in the quantum correlations between the (copies of the) system and the devices used to do the 
preparation. These quantum correlations are fully inaccessible if one performs measurements 
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on the system alone. The density matrix contains only the information about the statistics 
of the measurement on the system alone (but it encodes the maximally available information 
obtainable by measurements on the system only). 

Another subtle point is that an ensemble of copies of a system is described by a density 
matrix p for the single system if the different copies are really independent, i.e. if there are no 
quantum correlations between different copies in the ensemble. Some apparent paradoxes arise 
if there are such correlations. One must then consider the matrix density for several copies, 
taken as a larger composite quantum system. 

2.3.3.C - The von Neumann entropy 

The "degree of disorder" or the "lack of information" contained in a mixed quantum state 
co is given by the von Neumann entropy 

S(w) = -tr(p w logpo,) = -£]p„logp n (2.3.31) 

n 

It is the analog of the Boltzman entropy for a classical statistical distribution. It shares also 
some deep relation with Shannon entropy in information theory (more later). 

The entropy of a pure state is minimal and zero. Conversely, the state of maximal entropy 
is the statistical state where all quantum pure states are equiprobable. It is given by a density 
matrix proportional to the identity, and the entropy is the logarithm of the number of acces- 
sible different (orthogonal) pure quantum state, i.e. of the dimension of the Hilbert space (in 
agreement with the famous Boltzmann formula W = ks log N). 

p = -J-l , S = logN , N — dinVH (2.3.32) 



2.3.3.d - Application: Entanglement entropy 

An important context where the density matrix plays a role is the context of open quantum 
systems and multipartite quantum systems. Consider a bipartite system S composed of two 
distinct subsystems A and B. The Hilbert space %$ of the pure states of S is the tensor product 
of the Hilbert space of the two subsystems 

n s = n A ® H B (2-3.33) 

Let us assume that the total system is in a statistical state given by a density matrix p$, but that 
one is interested only in the subsystem A (or B). In particular one can only perform easement 
on observables relative to A (or B). Then all the information on A is contained in the reduced 
density matrix px, obtained by taking the partial trace of the density matrix for the whole 
system p$ over the (matrix indices relative to the) system B. 

PA = tre \ps\ (2-3.34) 

This is simply the quantum analog of taking the marginal of a probability distribution p(x,y) 
with respect to one of the random variables p x {x) = / dy p(x, y)). 

If the system S is in a pure state but if this state is entangled between A and B, the 
reduced density matrix p A is that of a mixed state, and its entropy is Sa (pa) > 0- Indeed when 
considering A only the quantum correlations between A and B have been lost. If S is in a pure 
state the entropies Sa{pa) = Sb{Pb)- This entropy is then called the entanglement entropy. 
Let us just recall that this is precisely one of the context where the concept of von Neumann 
entropy was introduced around 1927. More properties of features of quantum entropies will 
be given later. 
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2.3.3.e - Gibbs states 

A standard example of density matrix is provided by considering an quantum system S 
which is (weakly) coupled to a large thermostat, so that it is at equilibrium, exchanging freely 
energy (as well as other quantum correlations) with the thermostat, and at a finite temperature 
T. Then the mixed state of the system is a Gibbs state (or in full generality called a Kubo-Martin- 
Schwinger or KMS state). If the spectrum of the Hamiltonian H of the system is discrete, with 
the eigenstates \n),n E N and eigenvalues (energy levels) by E n (with Eq < E\ < E2 ■ ■ ■ ), the 
density matrix is 

with Z(/S) the partition function 

Z( J 6) = tr [exp(— j6H)] (2.3.36) 

and 

0=j^ (2.3.37) 
In the energy eigenstates basis the density matrix reads 

Pf> = T,Vn\n){n\ (2.3.38) 

n 

with p n the standard Gibbs probability 

Vn = Z[jS) ex P( _ ^ E ») ; Z (P) = ^ ^(-i 6 ^) ( 2 - 3 - 39 ) 



The expectation value of an observable A in the thermal state at temperature T is 

(A), - Tv (n\A\n) - tr [A 



(2.3.40) 



For infinite systems with an infinite number of degrees of freedom, such that several equilib- 
rium macroscopic states may coexist, the density matrix formalism is not sufficient and must 
be replaced by the formalism of KMS states ((Kubo-Martin-Sch winger). This will be discussed 
a bit more later in connection with superselection sectors in the algebraic formalism. 

2.3.3.f - Imaginary time formalism 

Let us come back to the simple case of a quantum non-relativistic system, whose energy 
spectrum is bounded below (and discrete to make things simple), but unbounded from above. 
The evolution operator 

U(t) =exp^ff) (2.3.41) 

considered as a function of the time t, may be extended from "physical" real time t e R to 
complex time variable, provided that 

Im(f) < (2.3.42) 

More precisely, U(t) as an operator, belongs to the algebra B(T-L) of bounded operators on the 
Hilbert space H. A bounded operator A on % is an operator whose L°° norm, defined as 

HAII^sup^W (2.3.43) 
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is finite. This is clear in the simple case where 




(2.3.44) 



The properties of the algebras of bounded operators and of their norm will be discussed in 
more details in the next section on the algebraic formulation of quantum mechanics. 





temps reel 


1 


► 

J rotation 
de Wick 



temps Euclidien 



Figure 2.5: Real time t and imaginary (Euclidean) time T = it: Wick rotation 



Consider now the case where t is purely imaginary 

t = -ir , t>0 real U(-ir) = exp (-Jh) (2.3.45) 

The evolution operator has the same form than the density matrix for the system in a Gibbs 
state at temperature T 

For relativistic quantum field theories, time became an "Euclidean coordinate" r = x°, and 
Minkowski space time becomes Euclidean space. There is deep analogy 

imaginary time = finite temperature 

This analogy has numerous applications. It is at the basis of many applications of quantum field 
theory to statistical physics (Euclidean Field Theory). Reciprocally statistical physics methods 
have found applications in quantum physics and high energy physics (lattice gauge theories). 
Considering quantum theory for imaginary time is also very useful in high energy physics, in 
quantum gravity. Finally this relation between Gibbs (KMS) states and the unitary evolution 
operator extends to a more general relation between states and automorphisms of some op- 
erator algebras (Tomita-Takesaki theory), that we shall discuss (very superficially) in the next 
chapter. 



2.4 Path and functional integrals formulations 

2.4.1 Path integrals 

It is known since Feynman that a very useful, if not always rigorous, way to represent 
matrix elements of the evolution operator of a quantum system (transition amplitudes, or 
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"propagators") is provided by path integrals (for non-relativistic systems with a few degrees 
of freedom) and functional integrals (for relativistic or non relativistic systems with continuous 
degrees of freedoms, i.e. fields). 

Standard references on path integral methods on quantum mechanics and quantum field 
theory are the original book by Feynman & Hibbs [RPF10|, and the books by J. Zinn-Justin 
|ZJ02l |ZJ10|. 

For a single particle in an external potential this probability amplitude K for propagation 
from qi at time to qt at time tt 

{q f \U{t f -t i )\q i ) = {q fr t f \q ir t i ) U(t) = exp (^H\ (2.4.1) 

(the first notation refers to the Schrodinger picture, the second one to the Heisenberg picture) 
can be written as a sum of histories q(t) 

f V[q] exp (U[q]) (2.4.2) 

A(fi)=<7l< <?(*/)=<?/ V" / 

where S[q] is the classical action. 



espace de 
configuration 
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NT 
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Figure 2.6: Path integral: time discretization 



The precise derivation of this formula, as well as its proper mathematical definition, is ob- 
tained by decomposing the evolution of the system in a large number N of evolutions dur- 
ing elementary time step At = e = t/N, at arbitrary intermediate positions q(t n = ne), 
n S {1, • • • ,N — 1}, using the superposition principle. One then uses the explicit formula 
for the propagation kernel at small time (the potential V(q) may be considered as constant 
locally) 

KM „ (' (=<S^-.V (*±*))) (,4,, 

and one then takes the continuous time limit e — > 0. The precise definition of the measure over 
histories or paths is (from the prefactor) 

V[q]=Y\^dq{t n ){^y n ^ (2.4.4) 



Francois David, 2012 



Lecture notes - November 27, 2012 



2.4. PATH AND FUNCTIONAL INTEGRALS FORMULATIONS 



2-23 



The "Lagrangian" path integral has a "Hamiltonian" version (path integral in phase space) 

/ , , V[q, p] exp ( l - f dt (p4 - H(q, p)))) (2.4.5) 

But one must be very careful on the definition of this path integral (discretization and contin- 
uum time limit) and on the measure in order to obtain a consistent quantum theory. 

2.4.2 Field theories, functional integrals 

Path integral representations extend to the case of relativistic quantum field theories. For 
instance for the scalar field, whose classical action (giving the Klein-Gordon equation) is 

S[<p] =JdtJ ((I)'- (l) 2 - m ^ 2 ) = / * x \ (2-4-6) 
a path integral involves an integral over field configurations over space-time of the form 

J V[<p) es s M 

and is usually denoted a functional integral. 

More precisely the vacuum expectation value of time ordered product of local field opera- 
tors (p in this quantized field theory can be expressed as a functional integral 

<n|T0(*i) • • • 4>(x N )\n) = ^Jv[<P) e* s M<p( Xl ) ■ ■ ■ <p(x N ) (2.4.7) 
with Z the partition function or vacuum amplitude 




(2.4.8) 



The factor Z means that the functional integral is normalized so that the vacuum to vacuum 
amplitude is 

(Q|Q) = 1 

The path integral and functional integral formulations are invaluable tools to formulate 
many quantum systems and quantum field theories, and perform calculations. They give a 
very simple and intuitive picture of the semiclassical regimes. It explains why the laws of 
classical physics can be formulated via variational principles, since classical trajectories are 
just the stationary phase trajectories (saddle points) dominating the sum over trajectories in 
the classical limit h —> 0. In many cases it allows to treat and visualize quantum interference 
effects when a few semi-classical trajectories dominates (for instance for trace formulas). 

Functional integral methods are also very important conceptually for quantum field the- 
ory: from the renormalization of QED to the quantization and proof of renormalisability of non 
abelian gauge theories, the treatment of topological effects and anomalies in QFT, the formu- 
lation of the Wilsonian renormalization group, the applications of QFT methods to statistical 
mechanics, etc. They thus provides a very useful way to quantize a theory, at least in semi- 
classical regime where one expect that the quantum theory is not too strongly coupled and 
quantum correlations and interference effects can be kept under control. 

I will not elaborate further here. When discussing the quantum formalism, one should keep 
in mind that the path and functional integrals represent a very useful and powerful (if usually 
not mathematically rigorous) way to visualize, manipulate and compute transition amplitudes, 
i.e. matrix elements of operators. They rather represent an application of the standard canoni- 
cal formalism, allowing to construct the Hilbert space (or part of it) and the matrix elements of 
operators of a quantum theory out of a classical theory via a quick and efficient recipe. 
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2.5 Quantum mechanics and reversibility 

2.5.1 Is quantum mechanics reversible or irreversible? 

An important property of quantum (as well as classical) physics is reversibility: the general 
formulation of the physical laws is the same under time reversal. This is often stated as: 

"There is no microscopic time arrow." 

This does not mean that the fundamental interactions (the specific physical laws that govern 
our universe) are invariant under time reversal. It is known that (assuming unitarity locality 
and Lorentz invariance) they are invariant only under CPT, the product of charge conjugation, 
parity and time reversal. This reversibility statement means that the dynamics, viewed forward 
in time (press key ► ), of any given state of a system is similar to the dynamics, viewed 
backward in time (press key < ), of some other state. 

This reversibility statement is of course also different from the macroscopic irreversibility 
that we experience in everyday life (expansion of the universe, second principle of Thermody- 
namics, quantum measurement, Parkinson's laws [Par55I , etc.). 

In classical mechanics reversibility is an obvious consequence of the Hamiltonian formu- 
lation. In quantum mechanics things are more subtle. Indeed if the evolution of a "closed 
system" (with no interaction with its environment or some observer) is unitary and reversible 
(and in particular possible quantum correlations between the system and its "outside" are kept 
untouched), quantum measurements are irreversible processes. However it is known since a 
long time that microscopic reversibility is not really in contradiction with this irreversibility. 
See for instance the '64 paper by Aharonov, Bergmann & Lebowitz [ABL64]. Since this will be 
very important in these following lectures, especially in the presentation of the quantum logic 
formalism, let us discuss it on a simple, but basic example, with the usual suspects involved in 
quantum measurements. 



2.5.2 Reversibility of quantum probabilities 

We consider two observers, Alice and Bob. Each of them can measure a different observable 
(respectively A and B) on a given quantum system S (for simplicity S can be in a finite number 
of states, i.e. its Hilbert space is finite dimensional). We take these observations to be perfect 
(non demolition) test measurements, i.e. yes/no measurements, represented by some selfad- 
joint projectors Pa and Pg such that P A = P^ and P| = Pg, but not necessarily commuting. 
The eigenvalues of these operators are 1 and 0, corresponding to the two possible outcomes 1 
and (or TRUE and FALSE ) of the measurements of the observables A and of the observable 
B. 

Let us consider now the two following protocols. 



Protocol 1: Alice gets the system S (in a state she knows nothing about). She measures A and 
if she finds TRUE, then she send the system to Bob, who measures B. What is the plausibility^] 
for Alice that Bob will find that B is TRUE? Let us call this the conditional probability for B to 
be found true, A being known to be true, and denote it P(B -ffi A). The arrow i denotes the 
causal ordering between the measurement of A (by Alice) and of B (by Bob). 

1. In a Bayesian sense. 
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Figure 2.7: Protocol 1: Alice wants to guess what Bob will measure. This defines the conditional 
probability P(B<\-\A). 



Protocol 2: Alice gets the system S from Bob, and knows nothing else about S. Bob tells her 
that he has measured B, but does not tell her the result of his measurement, nor how the sys- 
tem was prepared before he performed the measurement (he may know nothing about it, he 
just measured B). Then Alice measures A and (if) she finds TRUE she asks herself the follow- 
ing question: what is the plausibility (for her, Alice) that Bob had found that B was TRUE70 
Let us call this the conditional probability for B to have been found true, A being known to 
be true, and denote it by P(B h[> A). The arrow \— > denotes the causal ordering between the 
measurement of A (by Alice) and of B (by Bob). 

If S was a classical system, and the mesurements were classical measurements which do 
not change the state of S, then the two protocols are equivalent and the two quantities equal 
the standard conditional probability (Bayes formula) 

S classical system : P(B^A) = P(B^A) = P(B\A) = P(Bf)A)/P(A) . 



In the quantum case, at a purely logical level, knowing only that the measurement process 
may perturb the system S, P(B <\-<A) and P(B ^A) maybe different. A crucial and remarkable 
property of quantum mechanics is that they are still equal. Indeed in the first protocol P(B «fi A) 
is given by the Born rule; if Alice finds that A is TRU E and knows nothing more, her best bet is 
that the state of S is given by the density matrix 

p A = P A /Tr(P A ) 

Therefore the probability for Bob to find that B is TRUE is 

P{B^A)=tr{p A F B ). 

2. This question makes sense if for instance, Alice has made a bet with Bob. Again, and especially for this 
protocol, the probability has to be taken in a Bayesian sense. 
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Figure 2.8: Protocol 2: Alice wants to guess what Bob has measured. This defines the condi- 
tional probability P (B i-f> A ) . 



In the second protocol the best guess for Alice is to assume that before Bob measures B the 
state of the system is given by the equidistributed density matrix pi = l/tr(l). In this case the 
probability that Bob finds that B is TRUE, then that Alice finds that A is TRUE, is 

Vl = tr(P B )/tr(l) x tr(p B F A ) with p B = P B /Tr(P B ). 

Similarily the probability that Bob finds that B is FALSE, then that Alice finds that A is TRUE is 

p 2 = tr(l - P B )/tr(l) x tr( Pl F A ) = (tr(P A ) - tr(P A P B ))/tr(l) 

where p B = (1 — P B )/tr(l — P B ). The total probability is then 

P(B^A) = pi + p 2 = tr(p A P B ). 

Therefore, even if A and B are not compatible, i.e. if F A and P B do not commute, we obtain 
in both case the standard result for quantum conditional probabilities 

S quantum system : P(B <^A) = P(B ^A) = Tr[P A P B ]/Tr[P A ] (2.5.1) 

This reversibility property (that I denote here causal reversibility, in order not to confuse it 
with time reversal invariance) is very important, as we shall see later. 
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Chapter 3 

Algebraic quantum formalism 



3.1 Introduction 

In this formulation, quantum mechanics is constructed from the classical concepts of ob- 
servables and states, assuming that observables are not commuting quantities anymore but 
still form an algebra, and using the concepts of causality and reversibility. Of course such ideas 
go back to the matrix mechanics of Heisenberg, but the precise formulation relies on the math- 
ematical theory of operator algebras, initiated by F. J. Murray and J. von Neumann in the end 
of the thirties (one motivation of J. von Neumann was precisely to understand quantum me- 
chanics). It was developped by Segal (Segal 47), and then notably by Wightman, Haag, Kastler, 
Ruelle, etc. 

The standard and excellent reference on the algebraic and axiomatic approaches to quantum 
field theory is the book by R. Haag, Local Quantum Physics, especially the second edition 
(1996) |Haa96|. Another older reference is the book by N. N. Bogoliubov; A. A. Logunov, A.I. 
Oksak and I.T. Todorov (1975, 1990) |BLOT90J. Another useful reference is the famous book by 
R. F. Streater and A. S. Wightman (1964, 1989) IISAOOl . 

Standard references on operator algebras in the mathematical litterature are the books by 
J. Dixmier (1981, 1982) IIDix69l , Sakai (1971) HSak7H , P. de la Harpe and V. Jones (1995) |d!HJ95| . 
References more oriented towards the (mathematical) physics community are Bratteli and Robin- 
son(1979) MBR02H . A. Connes (1994) IICon94l and A. Connes and M. Marcolli IICM07I . I shall 
need also some results on real C* -algebras and the only good reference I am aware of is Good- 
earl (1982) |Goo82l. 

I shall give here a very brief and crude presentation of the algebraic formulation of quantum 
theory. It will stay at a very heuristic level, with no claim of precision or of mathematical rigor. 
However the starting point will be a bit different from the usual presentation, and was pre- 
sented in IDa vlll . I shall start from the general concepts of observables and states, and derive 
why abstract real O '-algebras are the natural framework to formulate quantum theories. Then I 
shall explain which mathematical results ensure that the theory can always be represented by 
algebras of operators on Hilbert spaces. Finally I shall explain why locality and separability 
enforce the use of complex algebras and of complex Hilbert spaces. 

3.2 The algebra of observables 

3.2.1 The mathematical principles 

A quantum system is described by its observables, its states and a causal involution acting 
on the observables and enforcing constraints on the states. Let us first give the axioms and 
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motivate them later on. 
3.2.1.a - Observables 

The physical observables of the system generate a real associative unital algebra A (whose 
elements will still be denoted "observables" ) . A is a linear vector space 

a,bei A, y. G R Aa + ]ib G A 

with an associative product (distributive w.r.t the addition) 

a,b,ceA ab G A (ab)c = a(bc) (3.2.1) 

and an unity 

la = al = a , Va G .4 (3.2.2) 
We shall precise later what are "physical observables". 

3.2.1.b - The * -conjugation 

There is an involution * on A (denoted conjugation). It is an anti-automorphism whose 
square is the identity. This means that 

(Aa + ^b)* = Aa* + ^b* 

(a*)* = a (ab)* = b*a* (3.2.3) 

3.2.1. C - States 

Each cp associates to an observable a its expectation value <p(a) G R in the state (p. The states 
satisfy 

<p(Aa + jib) = A<p(a) + ji(p{b) 

<p(a*) = <p(a) <p(l) = 1 <p(a*a) > (3.2.4) 

The set of states is denoted £. It is natural to assume that it allows to discriminate between 
observables, i.e. 

V a ^ b G A (and ^ 0), 3 <p G £ such that <p(a) £ <p(b) (3.2.5) 

I do not discuss the concepts of time and dynamics at that stage. This will be done later. 
I first discuss the relation between these "axioms" and the physical concepts of causality re- 
versibility and probabilities. 

3.2.2 Physical discussion 

3.2.2. a - Observables and causality 

In quantum physics, the concept of physical observable corresponds both to an operation 
on the system (measurement) and to the response on the system (result on the measure), but 
I shall not elaborate further. We already discussed why in classical physics observables form 
a real commutative algebra. The removal of the commutativity assumption is the simplest 
modification imaginable compatible with the uncertainty principle (Heisenberg 1925). 

Keeping the mathematical structure of an associative but non commutative algebra reflects 
the assumption that there is still some concept of "causal ordering" between observables (not 
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necessarily physical), in a formal but loose sense. Indeed the multiplication and its associativity 
means that we can "combine" successive observables, e.g. ab ~ (b then a), in a linear process 
such that ((c then b) then a) ~ (c then (b then a)). This "combination" is different from the 
concept of "successive measurement". 

Without commutativity the existence of an addition law is already a non trivial fact, it means 
that we can "combine" two non compatible observations into a new one whose mean value is 
always the sum of the first two mean values. 

Both addition and mutiplication of observables are in fact more natural in the context of 
relativistic theories, via the analyticity properties of correlation functions and the short time 
and short distance expansions. 

3.2.2.b - The * -conjugation and reversibility 

The existence of the involution * (or conjugation) is the second and very important fea- 
ture of quantum physics. It implies that although the observables do not commute, there is 
no favored arrow of time (or causal ordering) in the formulation of a physical theory, in other 
word this is reversibility. To any causal description of a system in term of a set of observables 
{a, b, . . .} corresponds an equivalent "anti-causal" description it terms of conjugate observ- 
ables {a*, b*, . . .}. Although there is no precise concept of time or dynamics yet, the involu- 
tion * must not be confused with the time reversal operator T (which may or may not be a 
symmetry of the dynamics). 

3.2.2.C - States, mesurements and probabilities 

The states cp are the simple generalisation of the classical concept of statistic (or probabilis- 
tic) states describing our knowledge of a system through the expectation value of the outcome 
of measurements for each possible observables. At that stage we do not assume anything about 
whether there are states such that all the values of the observables can be determined or not. 
Thus a state can be viewed also as the characterization of all the information which can be 
extracted from a system through a measurement process (this is the point of view often taken 
in quantum information theory). We do not consider how states are prepared, nor how the 
measurements are performed (this is the object of the subpart of quantum theory known as the 
theory of quantum measurement) and just look at the consistency requirements on the outcome 
of measurements. 

The "expectation value" <p(a) of an observable a can be considered as well as given by the 
average of the outcome of measurements a over many realisations of the system in the same 
state (frequentist view) or as the sum over the possible outcomes a, times the plausibility for the 
outcomes in a given state (Bayesian view). In fact both point of views have to be considered, 
and are somehow unified, in the quantum formalism. 

The linearity of the f's follows from (or is equivalent to) the assumptions that the observ- 
ables form a linear vector space on IR. 

The very important condition 

<p(a*) = <p(a) 

for any a follows from the assumption of reversibility. If this were not the case, there would be 
observables which would allow to favor one causal ordering, irrespective of the dynamics and 
of the states of the system. 

The positivity condition <p(a*a) > ensures that the states have a probabilistic interpre- 
tation, so that on any state the expectation value of a positive observable is positive, and that 
there are no negative probabilities, in other word it will ensure unitarity. It is the simplest con- 
sistent positivity condition compatible with reversibility, and in fact the only possible without 
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assuming more structure on the observables. Of course the condition <p(l) = 1 is the normali- 
sation condition for probabilities. 

3.2.3 Physical observables and pure states 

Three important concepts follow from these principles. 

3.2.3.a - Physical (symmetric) observables: 

An observable a E A is symmetric (self adjoint, or self conjugate) if a* = a. Symmetric ob- 
servables correspond to the physical observables, which are actually measurable. Observable 
such that a* = — a are skew-symmetric (anti-symmetric or anti conjugate). They do not cor- 
respond to physical observables but must be included in order to have a consistent algebraic 
formalism. 

3.2.3.b - Pure states: 

The set of states £ is a convex subset of the set of real linear forms on A (the dual of A). 
Indeed if <p\ et q>2 are two states and < x < 1, cp = xq>\ + (1 — x)<p2 is also a state. This 
corresponds to the fact that any statistical mixture of two statistical mixtures is a statistical 
mixture. Then the extremal points in £ , i.e. the states which cannot be written as a statistical 
mixture of two differents states in £ , are called the pure states. Non pure states are called mixed 
states. If a system is in a pure state one cannot get more information from this system than what 
we have already. 

3.2.3.C - Bounded observables 

We just need to impose two additional technical and natural assumptions: (i) for any ob- 
servable a / 0, there is a state q> such that <p(a*a) > 0, if this is not the case, the observable 
a is indistinguishable from the observable (which is always false); (ii) sup^g <p(a*a) < oo, 
i.e. we restrict A to the algebra of bounded observables, this will be enough to characterize the 
system. 

3.3 The C*-algebra of observables 

The involution * et the existence of the states <p £ £ on A strongly constrain the structure 
of the algebra of observables and of its representations. Indeed this allows to associate to A a 
unique norm || • || with some specific properties. This norm makes A a C* -algebra, and more 
precisely a real abstract C*-algebra. This structure justifies the standard representation of 
quantum mechanics where pure states are elements of an Hilbert space and physical observ- 
ables are self-adjoint operators. 

3.3.1 The norm on observables, A is a Banach algebra 

Let us consider the function a — > |a| from A — > R + defined by 





(3.3.1) 



states cp 6 £ 
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We have assumed that | a | < oo,\/a G A and that | a | =0 <J=> a = (this is equivalent to 
a / => 3<p G £ such that <p(a*a) 7^ 0). It is easy to show that 1 1 • 1 1 is a norm on A, such 
that 

||Aa|| = |A| ||a|| | |a + b| | < | |a| | + | |b| | ||ab| | < ||a| | ||b|| (3.3.2) 

If A is not closed for this norm, we can take its completion A. The algebra of observables is 
therefore a real Banach algebra. 



Derivation: 

The first identity comes from the definition and the linearity of states. 

Taking c = xa + (1 — x)b and using the positivity of cp(c*c) > for any x G R we obtain 
Schwartz inequality cp{a*b) 2 = cp(a*b)(p(b*a) < f(a*a)cp(b*b), V a, b G A. This implies the 
second inequality. 

The third inequality comes from the fact that if q> G £ and b G A are such that cp(b*b) > 0, 
then <pj, defined by cpb{ a ) = ^n^y is a l so a state for A Then ||fl&|| 2 = sup^ cp(b*a*ab) = 
slippy b {a*a)(p(b*b) <sup g g(a*a) sup^ <p{b*b) = \ \a\\ 2 \\b\\ 2 . 

3.3.2 The observables form a real C*-algebra 

Moreover the norm satisfies the two non-trivial properties. 

||a*a|| = ||a|| 2 = ||a*|| 2 (3.3.3) 

and 

l + a*a isinvertible VaGi (3.3.4) 

These two properties are equivalent to state that A is a real C*-algebra. R For a definition of 
real C*-algebras and the properties used below see the book by Goodearl [Goo82 |. 



Derivation: 

One has 



\a*a\\ < Nfll 



*||. S chwar tz inequality implies that q>(a*a) 2 < (p ((a*a) 2 ) <p(l), 
hence ||fl|| 2 < This implies (3.3.3 '. 

To obtain ( 3.3.41 , notice that if 1 + a* a is not inversible, there is a b ^ such that (1 + a*a)b = 
0, hence b*b + (ab)* (ab) = 0. Since there is a state q> such that q>(b*b) ^ 0, either cp(b*b) < or 
cp((ab)* (ab) < 0, this contradicts the positivity of states. 

The full consequences will be discussed in next subsection. Before that we can introduce 
already the concept of spectrum of an observable. 



3.3.3 Spectrum of observables and results of measurements 

Here I discuss in a slightly more precise way the relationship between the spectrum of 
observables and results of measurements. The spectrum^jof an element a G A is defined as 

Sp c (a) = {z G C : (z — a) not inversible in Ac the complexified of A} . 



1. The first condition on the norm and the involution l3.3.3l is sometimes called the C* condition. The "C" letter 
in the denomination C* -algebra originally comes from term "closed", the closure condition specific to subalgebras 
of the algebra of bounded operators on a Hilbert space which defines also C* -algebras. The second condition 
is specific to real algebras. 

2. The exact definition is slightly different for a general real Banach algebra. 



3.3.4 
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The spectral radius of a is defined as 

r c (a) =sup(|z|; z € Sp c (a)) 



For a real C*-algebra it is known that the norm 1 1 • 1 1 defined by 3.3.1 is 

||fl|| 2 = r c (a*a) 

that the spectrum of any physical observable (symetric) is real 

a = a* =>■ Sp c (a) C R 

and that for any a, the product a* a is a symmetric positive element of A, i.e. its spectrum is real 
and positive 

Sp c (a*a) C R+ 

Finally for any (continuous) real function F R — > R and any a G A one can define the ob- 
servable F(a). Now consider a physical observable a. Physically measuring F(a) amounts to 
measure a and when we get the real number A as a result, return F(A) as a result of the measure 
of F(a) (this is fully consistent with the algebraic definition of F(a) since F(a) commutes with 
a). Then is can be shown easily that the spectrum of F(a) is the image by F of the spectrum of 
a, i.e. 

Sp c (F(a))=F(Sp c (a)) 

In particular, assuming that the spectrum is a discrete set of points, let us choose for F the 
function 

F[a] = l/(zl-a) 

For any state cp, the expectation value of this observable on the state cp is 

E v (z) = <Kl/(zl-a) 

and is an analytic function of z away from the points of the spectrum Sp c (a)). (Assuming that 
the singularity at each z p is a single pole) the residue of E^z) at z v is nothing but 

Res Zp E v = cp(6(a- z p l)) 

= probabiliy to obtain z p when measuring a on the state cp (3.3.5) 

with S(z) the Dirac distribution. 

This implies that for any physical observable a, its spectrum is the set of all the possible 
real numbers z p returned by a measurement of a. This is one of the most important axioms 
of the standard formulation of quantum mechanics, and we see that it is a consequence of the 
axioms in this formulation. Of course the probability to get a given value z p (an element of the 



spectrum) depends on the state / of the system, and it is given by 3.3.5 which is nothing but 
some kind of Born rule for the abstract definiton of states. 



3.3.4 Complex C*-algebras 

The theory of operator algebras (C*-algebras and W*-algebras) and their applications (al- 
most) exclusively deal with complex algebras, i.e. algebras over C. In the case of quantum 
physics we shall see a bit later why quantum (field) theories must be represented by complex 
C* -algebras. I give here some definitions. 
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Abstract complex C*-algebras and complex states (p are defined as in 3.2.1 A complex C*- 
algebra 21 is a complex associative involutive algebra. The involution is now anti-linear 

(Aa + jib)* = Aa* + pb* A, fi G C 

z denotes the complex conjugate of z. 21 has a norm a — > | a | which still satisfy the C* condition 

||a*a|| = ||a|| 2 = ||a*|| 2 (3.3.6) 



3.3.3 



and it is closed under this norm. The condition 3.3.4 is not necessary any more (it follows from 



3.3.6 for complex algebras). 



The states are defined now as the complex linear forms <p on 21 which satisfy 



<p{&*) = <f>(a) <()(1) = 1 <Ka*a) > (3.3.7) 

Any complex C*-algebra 21 can be considered as a real C*-algebra Ar (by considering i = y^T 
as an element i of the center of ^4r) but the reverse is not true in general. 

However if a real algebra Ar has an element (denoted i) in its center C that is isomorphic 
to yj— 1, i.e. I is such that 

i = -i , i 2 = -1 , ia = ai V a G Hr (3.3.8) 

then the algebra ^4r is isomorphic to a complex algebra Ac = 21. One identifies xl + yi with 
the complex scalar z = x + iy. The conjugation * (linear on Ar) is now anti-linear on Ac- One 
can associate to each a G Ar its real and imaginary part 

Re(a) = , Im(a) = i**^* (3.3.9) 

and write in Ac 

a = Re(a)+iIm(a) (3.3.10) 

To any real state (and in fact any real linear form) fR on T-Lr one associates the complex state 
(the complex linear form) cpc on Ac defined as 

<£c(a) = <p K (Re(a))+i<p R (Im(a)) (3-3.11) 
It has the expected properties for a complex state on the complex algebra 21. 



3.4 The GNS construction, operators and Hilbert spaces 

General theorems show that abstract C*-algebras can always be represented as algebra of 
operators on some Hilbert space. This is the main reason why pure states are always repre- 
sented by vectors in a Hilbert space and observables as operators. Let us briefly consider how 
this works. 

3.4.1 Finite dimensional algebra of observables 

Let us first consider the case of finite dimensional algebras, which corresponds to quantum 
system with a finite number of independent quantum states. This is the case considered in 
general in quantum information theory. 

If A is a finite dimensional real algebra, one can show by purely algebraic methods that A 
is a direct sum of matrix algebras over R, C or H (the quaternions). See [Goo82j for details. The 
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idea is to show that the C* -algebra conditions implies that the real algebra A is semi-simple (it 
cannot have a nilpotent two-sided ideal) and to use the Artin-Wedderburn theorem. One can 
even relax the positivity condition <p(a*a) > for any a to the condition <p(a 2 ) > for physical 
observables a = a*, which is physically somewhat more satisfactory (F. David unpublished, 
probably known in the math litterature...). Thus the algebra is of the form 

A = © M n, (Ki) Ki = R, C, H (3.4.1) 

i 

The index i label the components of the center of the algebra. Any observable reads 

a = e f a/, a,- G Ai = M m (Ki) 

The multiplication corresponds to the standard matrix multiplication and the involution * 
to the standard conjugation (transposition, transposition+complex conjugation and transposi- 
tion+conjugation respectively for real, complex and quaternionic matrices). One thus recovers 
the familiar matrix ensembles of random matrix theory. 
Any state to can be written as 

w(a) = £ Pi tr (Pi a Pi > ' E Pi = 1 

i i 
and the p/s some symmetric positive normalised matrices in each Aj 

Pi e Ai = M ni (Ki) , Pi = p\, tt( Pi ) = l, Pi >0 

The algebra of observables is indeed a subalgebra of the algebra of operators on a finite di- 
mensional real Hilbert space H = 0j K"' (C and JT being considered as 2 dimensional and 4 
dimensional real vector spaces respectively). But it is not necessarily the whole algebra C(H). 
The system corresponds to a disjoint collection of standard quantum systems described by their 
Hilbert space "H, = K"' and their algebra of observables Ai. This decomposition is (with a bit 
of abuse of language) a decomposition into superselection sectors^] The Pj are the quantum 
density matrices corresponding to the state. The p/s correspond to the classical probability to 
be in a given sector, i.e. in a state described by (Ai, Hi). 

A pure state is (the projection onto a) single vector \tpi) in a single sector Hi. Linear super- 
positions of pure states in different sectors \ip) = Y^iCiltfi) do not make sense, since they do 
not belong to the representation of A. No observable a in A allows to discriminate between 
the seemingly-pure-state \ip)(ip\ and the mixed state |ci| 2 |i/>,-)(i/?;|. Thus the different sectors 
can be viewed as describing completely independent systems with no quantum correlations, 
in other word really parallel universes with no possible interaction or communication between 
them. 

3.4.2 Infinite dimensional real algebra of observables 

This result generalizes to the case of infinite dimensional real C*-algebras, but it is much 
more difficult to prove, analysis and topology enter in the game and the fact that the algebra is 
closed under the norm is crucial (for a physicist this is a natural requirement). 

Theorem (Ingelstam NN Hne64^ \Goo82i ): For any real C*-algebra, there exists a real Hilbert 
space % such that A is isomorphic to a real symmetric closed real sub-algebra of the algebra 
B(7i) of bounded operators on %. 

3. For many authors the term of superselection sectors is reserved to infinite dimensional algebras which do 
have inequivalent representations. 
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Now any real algebra of symmetric operators on a real Hilbert space T-i may be extended (by 
standard complexification) into a complex algebra of self-adjoint operator on a Hilbert space 
Tic on C and thus one can reduce the study of real algebra to the study of complex algebra. 
In particular the theory of representations of real C* -algebra is not really richer than that of 
complex C*-algebra and mathematicians usuallyl considers only the later case. 

I will discuss later why in quantum physics one should restrict oneself also to complex al- 
gebras. But note that in physics real (and quaternionic) algebra of observables do appear as the 
subalgebra of observables of some system described by a complex Hilbert space, subjected to 
some additional symmetry constraint (time reversal invariance T for real algebra, time reversal 
and an additional SU(2) invariance for quaternionic algebras). 



3.4.3 The complex case, the GNS construction 

Let us discuss more the case of complex C*-algebras, since their representation in term of 
Hilbert spaces are simpler to deal with. The famous GNS construction (Gelfand-Naimark-Segal 
|GN43l|Seg47|) allows to construct the representations of the algebra of observables in term of 



its pure states. It is interesting to see the basic ideas, since this allows to understand how the 
Hilbert space of physical pure states emerges from the abstract^] concepts of observables and 
mixed states. 

The idea is somewhat simple. To every state (p we associate a representation of the algebra 
A in a Hilbert space Hq. This is done as follows. The state (p allows to define a bilinear form 
( | ) on A, considered as a vector space on C, through 

<a|b>, = 0(a*b) (3.4.2) 

This form is > but is not > 0, since there are in general isotropic (or null) vectors such that 
(a|a)^, = 0. Thus A with this norm is a per-Hilbert space. However, thanks to the C*-condition, 
these vectors form a linear subspace T$ of A. 

l<p = {a£A: (a|a) = 0} (3.4.3) 

Taking the (completion of the) quotient space one obtains the vector space 



= A/l<p (3.4.4) 

When there is no ambiguity, if a is an element of the algebra A (an observable), we denote by 
| a) the corresponding vector in the Hilbert space that is the equivalent class of a in %,p 

\a) = {b G A : b-a£l f } (3.4.5) 

On this space the scalar product (a\b) is > (and is closed) hence is a Hilbert space. 



Now the algebra A acts linearily on through the representation rc^ (in the space of 
bounded linear operators B(H<p on defined as 

7ty(a)|&) = \ab) (3.4.6) 

Moreover, if we consider the vector |^) = |1) G T-L<p (the equivalence class of the operator 
identity 1 G A), it is of norm 1 and such that 

<Ka) = (^Ma)|^) (3.4.7) 

4. in the mathematic sense: they are not defined with reference to a given representation such as operators in 
Hilbert space, path integrals, etc. 
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(this follows basically from the definition of the representation). Moreover this vector f^) is 
cyclic, this means that the action of the operators on this vector allows to recover the whole 
Hilbert space more precisely 



n^A)\^)=n^ (3.4.8) 

However this representation is in general neither faithful (different observables may be 
represented by the same operator, i.e. the mapping n§ is not injective), nor irreducible has 
invariant subspaces). The most important result of the GNS construction is: 



Theorem (Gelfand-Naimark 43): The representation n§ is irreducible if and only if (p is a pure 
state. 



Proof: The proof is standard and may be found in [dlHJ95 1 



This theorem has far reaching consequences. First it implies that the algebra of observables 
A has always a faithful representation in some big Hilbert space H. Any irreducible represen- 
tation n of A in some Hilbert space "H is unitarily equivalent to the GNS representation n§ 
constructed from a unit vector | £) G % by considering the state 

*(a) = <£|7r(a)|g) 



Equivalent pure states Two pure states <p and ip are equivalent if their GNS representations n,p 
and are equivalent. Then (p and tp are unitarily equivalent, i.e. there is a unitary element 
u of A (u*u = 1) such that </>(a) = t/>(u*au) for any a. As a consequence, to this pure state tp 
(which is unitarily equivalent to <p) is associated a unit vector \ip) = 7T^>(u) |£<p) in the Hilbert 
space ri = ri§, and we have the representation 

i/>(a) = (ip\A\xp) , A = nJ a ) (3.4.9) 



In other word, all pures states which are equivalent can be considered as projection opera- 
tors \tp)(tp\ on some vector \tp) in the same Hilbert space T-L. Any observable a is represented 
by some bounded operator A and the expectation value of this observable in the state tp is 
given by the Born formula 3.4.9| Equivalent classes of equivalent pure states are in one to one 



correspondence with the irreducible representations of the algebra of observables A. 

The standard formulation of quantum mechanics in terms of operators and state vectors is 
thus recovered! 



3.5 Why complex algebras? 

In the mathematical presentation of the formalism that I give here, real algebras play the 
essential role. However it is known that quantum physics is described by complex algebras. 
There are several arguments (besides the fact that it actually works) that point towards the ne- 
cessity of complex algebras. Indeed one must take into account some essential physical features 
of the quantum word: time, dynamics and locality. 
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3.5.1 Dynamics: 

Firstly, if one wants the quantum system to have a "classical limit" corresponding to a clas- 
sical Hamiltonian system, one would like to have conjugate observables P,-, Q ; whose classical 
limit are conjugate coordinates p ; , c\\ with a correspondence between the quantum commuta- 
tors and the classical Poisson brackets 

[Q,P] -4 i{p,q} (3.5.1) 

Thus anti-symmetric operators must be in one to one correspondence with symmetric ones. 
This is possible only if the algebra of operators is a complex one, i.e. if it contains an i element 
in its center. 

Another (but related) argument is that if one wants a time evolution group of inner auto- 
morphism acting on the operators (and the states), it is given by unitary evolution operators 
U(t) of the form 

U(t) = exp(fA) , A = -A* (3.5.2) 

This corresponds to an Hamiltonian dynamics with a physical observable corresponding to a 
conserved energy (and given by a Schrodinger equation) only if the algebra is complex, so that 
we can write 

A = -iH (3.5.3) 



There has been various attempts to construct realistic quantum theories of particles or fields 
based on strictly real Hilbert spaces, most notably by Stueckelberg and his collaborators in the 
'60. See BStu60l . None of them is really satisfying. 

3.5.2 Locality and separability: 

Another problem with real algebras comes from the requirement of locality in quantum 
field theory, and to the related concept of separability of subsystems. Locality will be discussed 
a bit more later on. But there is already a problem with real algebras when one wants to charac- 
terize the properties of a composite system out of those of its subconstituents. As far as I know, 
this was first pointed out by Araki, and recovered by various people, for instance by Wooter [] 
(see Auletta [] page 174 10.1.3). 

Let us considers a system S which consists of two separated subsystem S\ and S2 ■ Note 
that in QFT a subsystem is defined by its subalgebra of observables and of states. These are for 
instance the "system" generated by the observables in two causally separated regions. Then 
the algebra of observables A for the total system 1 + 2 is the tensor product of the two algebras 
A\ and A2 

A = Ai <8> A 2 (3.5.4) 

which means that A is generated by the linear combinations of the elements a of the form 
ai (g> a.2- 

Let us now assume that the algebras of observables Ai and A2 are (sub)algebras of the 
algebra of operators on some real Hilbert spaces H\ and %2- The Hilbert space of the whole 
system is the tensor product % = Hi (8> Hi- Observables are represented by operators A, and 
physical (symmetric ) operators a = a* correspond to symmetric operators A = A T . Now it is 
easy to see that the physical (symetric) observables of the whole system are generated by the 
products of pairs of observables {A\, A2) of the two subsystems which are of the form 

, , I A\ and A2 are both symmetric, or 
Ai®A 2 such that { (3.5.5) 

I A\ and A2 are both skew-symmetric 
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In both case the product is symmetric, but these two cases do not generate the same observ- 
ables. This is different from the case of algebras of operators on complex Hilbert spaces, where 
all symmetric operators on % = ri\ <8> Hi are generated by the tensor products of the form 



In other word, if a quantum system is composed of two independent subsystems, and the 
physics is described by a real Hilbert space, there are physical observables of the big system 
which cannot be constructed out of the physical observables of the two subsystems! This would 
turn into a problem with locality, since one could not characterize the full quantum state of 
a composite system by combining the results of separate independent measurements on its 
subparts. Note that this is also related to the idea of quantum tomography. 

3.5.3 Quaternionic Hilbert spaces: 

There has been also serious attempts to build quantum theories (in particular of fields) 
based on quaternionic Hilbert spaces, both in the '60 and more recently by S. Adler [Adl95 |. 
One idea was that the SU(2) symmetry associated to quaternions could be related to the sym- 
metries of the quark model and of some gauge interaction models. These models are also 
problematic. In this case there are less physical observables for a composite system that those 
one can naively construct out of those of the subsystems, in other word there are many non 
trivain constraints to be satisfied. A far as I know, no satisfying theory based on H, consistent 
with locality and special relativity, has been constructed. 

3.6 Superselection sectors 

3.6.1 Definition 

In the general infinite dimensional (complex) case the decomposition of an algebra of ob- 
servables A along its center Z(A) goes in a similar way as in the finite dimensional case. One 
can write something like 



where each A c is a simple C*-algebra. 

A very important difference with the finite dimensional case is that an infinite dimensional 
C*-algebra A has in general many inequivalent irreducible representations in a Hilbert space. 
Two different irreducible representations n\ and 7i2 of A in two subspaces ri\ and V.2 of a 
Hilbert space T-L are generated by two unitarily inequivalent pure states q>\ and q>2 of A. Each 
irreducible representation 7T, and the associated Hilbert space Hi is called a superselection sector. 
The great Hilbert space H generated by all the unitarily inequivalent pure states on A is the 
direct sum of all superselections sectors. The operators in A do not mix the different supers- 
election sectors. It is however often very important to consider the operators in B(TL) which 
mixes the different superselection sectors of A while respecting the structure of the algebra A 
(i.e. its symmetries). Such operators are called in tert winners. 

3.6.2 A simple example: the particle on a circle 

One of the simplest examples is the ronrelativistic particle on a one dimensional circle. Let 
us first consider the particle on a line. The two conjugate operators Q and P obey the canonical 
commutation relations 



A\ <g> A2 such that A\ and A2 are symmetric 



(3.5.6) 




(3.6.1) 




(3.6.2) 
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They are unbounded, but their exponentials 

U(fc) = exp(ifcQ) , V(x) = exp(ixQ) (3.6.3) 

generates a C*-algebra. Now a famous theorem by Stone and von Neumann states that all 
representations of their commutation relations are unitary equivalent. In other word, there 
is only one way to quantize the particle on the line, given by canonical quantization and the 
standard representation of the operators acting on the Hilbert space of functions on IR. 

Q = x , P=v|- (3.6.4) 
1 ox 

Now, if the particle is on a circle with radius 1, the position x becomes an angle 9 defined 
mod. In. The operator U(k) is defined only for integer momenta k = Inn, n G Z. The 
corresponding algebra of operators has now inequivalent irreducible representations, indexed 
by a number O. Each representation 7r<j> corresponds to the representation of the Q and P 
operators acting on the Hilbert space % of functions ip{6) on the circle as 

So each superselection sector describes the quantum dynamics of a particle with unit charge 
e — 1 on a circle with a magnetic flux O. No global unitary transformation (acting on the Hilbert 
space of periodic functions on the circle) can map one superselection sector onto another one. 
Indeed this would correspond to the unitary transformation 

ip(0) -> ip{6)e i6AA (3.6.6) 

and there is a topological obstruction if AA is not an integer. Here the different superselection 
sectors describe different "topological phases" of the same quantum system. 
This is of course nothing but the famous Aharonov-Bohm effect. 

3.6.3 General discussion 

The notion of superselection sector was first introduced by Wick, Wightman and Wigner in 
1952. They observed (and proved) that is is meaningless in a quantum field theory like QED 
to speak of the superposition of two states ipi and xp2 with integer and half integer total spin 
respectively, since a rotation by 2 n changes by ( — 1 ) the relative phase between these two states, 
but does not change anything physically. This apparent paradox disappear when one realizes 
that this is a similar situation than above. No physical observable allows to distinguish a linear 
superposition of two states in different superselection sectors, such as 1 1 fermion) + 1 1 boson) 
from a statistical mixture of these two states |1 fermion) (1 fermion | and |1 boson) (1 boson |. 
Indeed, any operator creating or destroying just one fermion is not a physical operator (bur 
rather an intertwining operator), but of course an operator creating or destroying a pair of 
fermions (or rather a pair fermion-antifermion) is physical. 

Superselection sectors are an important feature of the mathematical formulation of quan- 
tum field theories, but they have also a physical significance. One encounters superselection 
sectors in quantum systems with an infinite number of states (non-relativistic or relativistic) as 
soon as 

- the system may be in different phases (for instance in a statistical quantum system with 
spontatneous symmetry breaking); 

- the system has global or local gauge symmetries and sectors with different charges Q a 
(abelian or non abelian); 
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- the system contain fermions; 

- the system may exhibit different inequivalent topological sectors, this includes the simple 
case of a particle on a ring discussed above (the Aharonov-Bohm effect), but also gauge 
theories with 0-vacua; 

- Topological sector 

- more generally, a given QFT for different values of couplings or masses of particle may 
corresponds to different superselection sectors of the same algebra. 

- superselection sectors have also been used to discuss measurements in quantum mechan- 
ics and the quantum-to-classical transition. 

Thus one should keep in mind that the abstract algebraic formalism contains as a whole the dif- 
ferent possible states, phases and dynamics of a quantum system, while a given representation 
describes a subclass of states or of possible dynamics. 

3.7 von Neumann algebras 

A special class of C*-algebras, the so-called von Neumann algebras or W*-algebras, is of 
special interest in mathematics and for physical applications. As far as I know these were the 
algebras of operators originally studied by Murray and von Neumann (the ring of operators). 
Here I just give some definitions and some motivations, without details or applications. 

3.7.1 Definitions 

There are several equivalent definitions, I give here three classical definitions. The first two 
refer to an explicit representation of the algebra as an algebra of operators on a Hilbert space, 
but the definition turns out to be independent of the representation. The third one depends 
only on the abstract definition of the algebra. 

Weak closure: A a unital *- sub algebra of the algebra of bounded operators £(H) on a com- 
plex Hilbert space H is a W* -algebra iff A is closed under the weak topology, namely if for any 
sequence A n in A, if the individual matrix elements (x\A„\y) converge towards some matrix 
element A xy , this defines an operator in the algebra 

Vx,ye% (x\A n \y) -4 A xy A G A such that (x\A\y) = A xy (3.7.1) 

NB: The weak topology considered here can be replaced in the definition by stronger topolo- 
gies on C{T-L). In the particular case of commutative algebras, one can show that W*-algebras 
correspond to the set of measurable functions L°°(X) on some measurable space X, while C*- 
algebras corresponds to the set Cq{Y) of continuous functions on some Hausdorff space Y. 
Thus, as advocated by A. Connes, W*-algebras corresponds to non-commutative measure the- 
ory, while C* -algebras to non-commutative topology theory. 

The bicommutant theorem: A famous theorem by von Neumann states that A C L(H) is a 
W*-algebra iff it is a C*-algebra and it is equal to its bicommutant 



(the commutant A' of A is the set of operators that commute with all the elements of A, and 
the bicommutant the commutant of the commutant). 

NB: The equivalence of this "algebraic" definition with the previous "topological" or "analyti- 
cal" one illustrate the deep relation between algebra and analysis at work in operator algebras 



A = A 



ii 



(3.7.2) 



Francois David, 2012 



Lecture notes - November 27, 2012 



3.7. VON NEUMANN ALGEBRAS 



3-15 



and in quantum physics. It is often stated that this property means that a W* -algebra A is a 
symmetry algebra (since A is the algebras of symmetries of B = A'). But one can also view 
this as the fact that a W* -algebra is a "causally complete" algebra of observables, in analogy 
with the notion of causally complete domain (see the next section on algebraic quantum field 
theory). 

The predual property It was shown by Sakai that W*-algebras can also be defined as C*- 
algebras that have a predual, i.e. when considered as a Banach vector space, A is the dual 
of another Banach vector space B (A = £>*). 

NB: This definition is unique up to isomorphisms, since B can be viewed as the set of all (ultra 
weak) continuous linear functionals on A, which is generated by the positive normal linear 
functionals on A (i.e. the states) with adequate topology. So W*-algebras are also algebras with 
special properties for their states. 

3.7.2 Classification of factors 

A word on the famous classification of factors. Factors are W* -algebras with trivial center 
C = C and any W*-algebra can be written as an integral sum over factors. W*-algebra have the 
property that they are entirely determined by their projectors elements (a projector is such that 
a = a* = a 2 , and corresponds to orthogonal projections onto closed subspaces E of TL). The 
famous classification result of Murray and von Neumann states that there are basically three 
different classes of factors, depending on the properties of the projectors and on the existence 
of a trace. 

Type I: A factor is of type I if there is a minimal projector E such that there is no other projector 
FwithO < F < E. Type I factors always corresponds to the whole algebra of bounded operators 
L(%) on some (separable) Hilbert space H. Minimal projector are projectors on pure states 
(vectors in Wj. This is the case usually considered by "ordinary physicists". They are denoted 
l n if dim('H) = n (matrix algebra) and loo if dim('H) = oo. 

Type II: Type II factors have no minimal projectors, but finite projectors, i.e. any projector E 
can be decomposed into E = F + G where E, F and G are equivalent projectors. The type Hi 
hyper finitefactor has a unique finite trace co (a state such that co{l) = 1 and o;(aa*) = o;(a*a)), 
while type IIco = Hi <8> Ico. They play an important role in non-relativistic statistical mechanics 
of infinite systems, the mathematics of integrable systems and CFT. 

Type III: This is the most general class. Type III factors have no minimal projectors and no 
trace. They are more complicated. Their classification was achieved by A. Connes. These are 
the general algebras one must consider in relativistic quantum field theories. 

3.7.3 The Tomita-Takesaki theory 

Let me say a few words on a important feature of von Neumann algebras, which states that 
there is a natural "dynamical flow" on these algebras induced by the states. This will be very 
sketchy and naive. We have seen that in "standard quantum mechanics" (corresponding to a 
type I factor), the evolution operator U(t) = exp(-itH) is well defined in the lower half plane 
Im(f) < 0. 

This correspondence "state <H> dynamics" can be generalized to any von Neumann algebra, 
even when the concept of density matrix and trace is not valid any more. Tomita and Takesaki 
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showed that to any state (p on A (through the GNS construction cp(a) = (Q|aQ) where Q is 
a separating cyclic vector of the Hilbert space TL) one can associate a one parameter family 
of modular automorphisms af: A — » A, such that erf (a) = A lf aA~ lf , where A is positive 
selfadjoint modular operator in A. This group depends on the choice of the state <p only up to 
inner automorphisms, i.e. unitary transformations Ut such that &J (a) = u t af (a)nj l , with the 
1-cocycle property u s+t = u s cr s (u t ). 

As advocated by A. Connes, this means that there is a "global dynamical flow" acting on the 
von Neumann algebra A (modulo unitaries reflecting the choice of initial state). This Tomita- 
Takesaki theory is a very important tool in the mathematical theory of operator algebras. It has 
been speculated by some authors that there is a deep connection between statistics and time 
(the so called "thermal time hypothesis"), with consequences in quantum gravity. Without 
going to this point, this comforts the point of view that operator algebras have a strong link 
with causality. 

3.8 Locality and algebraic quantum field theory 

Up to now I have not really discussed the concepts of time and of dynamics, and the role of 
relativistic invariance and locality in the quantum formalism. One should remember that the 
concepts of causality and of reversibility are already incorporated within the formalism from 
the start. 

It is not really meaningful to discuss these issues if not in a fully relativistic framework. 
This is the object of algebraic and axiomatic quantum field theory. Since I am not a specialist 
I give only a very crude and very succinct account of this formalism and refer to the excellent 
book by R. Haag |Haa96J for all the details and the mathematical concepts. 

3.8.1 Algebraic quantum field theory in a dash 

In order to make the quantum formalism compatible with special relativity, one needs three 
things. 

Locality: Firstly the observables must be built on the local observables, i.e. the observables 
attached to bounded domains O of Minkovski space- time M = R 1,d_1 . They corresponds to 
measurements made by actions on the system in a finite region of space, during a finite interval 
of time. Therefore one associate to each domain O C M a subalgebra A(0) of the algebra of 
observables. 

O —¥ A{0) C A (3.8.1) 

This algebra is such that is 

A(Oi U 2 ) = V A(0 2 ) (3.8.2) 

where V means the union of the two subalgebras (the intersection of all subalgebras containing 
AiPx) and A(0 2 ). 
Note that this implies 

0iCO 2 A(Ot) c A(0 2 ) (3.8.3) 

The local operators are obtained by taking the limit when a domain reduces to a point (this is 
not a precise or rigorous definition, in particular in view of the UV divergences of QFT and the 
renormalization problems). 

Caution, the observables of two disjoint domains are not independent if these domains are 
not causally independent (see below) since they can be related by dynamical/ causal evolution. 
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Figure 3.1: The union of two domains 




Figure 3.2: For two causally separated domains, the associated observables must commute 



Causality: Secondly causality and locality must be respected, this implies that physical local 
observables which are causally independent must always commute. Indeed the result of mea- 
surements of causally independent observables is always independent of the order in which 
they are performed, independently of the state of the system. Were this not the case, the observ- 
ables would not be independent and through some measurement process information could be 
manipulated and transported at a faster than light pace. If 0\ and O2 are causally separated 
(i.e. any X\ — X2, x\ S 0\, X2 € O2 is space-like)) then any pair of operators A\ and A2 respec- 
tively in A{0\) and .4(02) commutes 

OiX 2, Aie^(Oi), A 2 eA(0 2 ) =^ [A lr A 2 }=0 (3.8.4) 
This is the crucial requirement to enforce locality in the quantum theory. 

NB: As already discussed, in theories with fermion, fermionic field operators like ip and xp are 
not physical operators, since they intertwin different sectors (the bosonic and the fermionic 
one) and hence the anticommutation of fermionic operators does not contradict the above rule. 



Causal completion: One needs also to assume causal completion, i.e. 

A{0) = A(0) (3.8.5) 

where the domain O is the causal completion of the domain O (O is defined as the set of points 
O" which are causally separated from the points of O' , the set of points causally separated 



from the points of O, see fig 3.3 for a self explanating illustration). 

This implies in particular that the whole algebra A is the (inductive) limit of the subalgebras 
generated by an increasing sequence of bounded domains whose union is the whole Minkovski 
space 

Oi C Oj if i < j and |J Oi = M 4 lim A{Oi) = A (3.8.6) 
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Figure 3.3: A domain O and its causal completion O (in gray) 



and also that it is equal to the algebra associated to "time slices" with arbitrary small time 
width. 

S £ = {x = (f, x) : t < t < T + e} (3.8.7) 




s 

Figure 3.4: An arbitrary thin space-like slice of space-time is enough to generate the algebra of 
observables A 

This indicates also why one should concentrate on von Neumann algebras. The set of local 
subalgebras C = {A(O) : O subdomainsof M} form an orthocomplemented lattice with 
interesting properties. 

Poincare invariance: The Poincare group ^{\,d — 1) = IR 1 ^ -1 x 0(\,d — 1) must act on the 
space of local observables, so that it corresponds to a symmetry of the theory (the theory must 
be covariant under translations in space and time and Lorentz transformations). When A is 
represented as an algebra of operators on a Hilbert space, the action is usually represented 
by unitary]^] transformations U(a,A) (a being a translation and A a Lorentz transformation). 
This implies in particular that the algebra associated to the image of a domain by a Poincare 
transformation is the image of the algebra under the action of the Poincare transformation. 

U(a,A)A(0)U- 1 {a,A) = A(AO + a) (3.8.8) 

The generator of time translations will be the Hamiltonian Po = H, and time translations 
acting on observables corresponds to the dynamical evolution of the system in the Heisenberg 
picture, in a given Lorentzian reference frame. 

The vacuum state: Finally one needs to assume the existence (and the uniqueness, in the ab- 
sence of spontaneous symmetry breaking) of a special state, the vacuum state | O) . The vacuum 
state must be invariant under the action of the Poincare transformations, i.e. U(a, A)\Cl) = \C1). 

5. Unitary with respect to the real algebra structure, i.e. unitary or antiunitary w.r.t. the complex algebra struc- 
ture. 
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Figure 3.5: The Poincare group acts on the domains and on the associated algebras 



At least in the vacuum sector, the spectrum of P 
translations) must lie in the future cone. 



(E, P) (the generators of time and space 



E 2 - f > , 



E > 



(3.8.9) 



This is required since the dynamics of the quantum states must respect causality. In particular, 
the condition E > (positivity of the energy) implies that dynamical evolution is compatible 
with the modular automorphisms on the algebra of observables constructed by the Tomita- 
Takesaki theory. 

3.8.2 Axiomatic QFT 

3.8.2.a - Wightman axioms 

One approach to implement the program of algebraic local quantum field theory is the so- 
called axiomatic field theory framework (Wightman & Garding). Actually the axiomatic field 
theory program was started before the algebraic one. In this formalism, besides the axioms 
of local, AQFT, the local operators are realized as "local fields". These local fields O are rep- 
resented as distributions (over space-time M) whose values, when applied to some C°° test 
function with compact support / (typically inside some O) are operators a = ($>•/). Local 
fields are thus "operator valued distributions". They must satisfy the Wightman's axioms (see 
Streater and Wightman's book [SAOOj and R. Haag's book, again), which enforce causality, 
locality, Poincare covariance, existence (and uniqueness) of the vacuum (and eventually in ad- 
dition asymptotic completeness, i.e. existence of a scattering S-matrix). 

3.8.2.b - CPT and spin-statistics theorems 

The axiomatic framework is very important for the definition of quantum theories. It is 
within this formalism that one can derive the general and fundamental properties of relativistic 
quantum theories 

- Reconstruction theorem: reconstruction of the Hilbert space of states from the vacuum 
expectation values of product of local fields (the Wightman functions, or correlation func- 
tions), 

- Derivation of the analyticity properties of the correlation functions with respect to space- 
time x = (t,x) and impulsion p = (E, p) variables, 

- Analyticity of the S matrix (an essential tool), 

- The CPT theorem: locality, Lorentz invariance and unitarity imply CPT invariance, 

- The spin statistics theorem, 
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- Definition of quantum field theories in Euclidean time (Osterwalder-Schrader axioms) 
and rigorous formulation of the mapping between Euclidean theories and Lorentzian 
quantum theories. 

3.9 Discussion 

I gave here a short introduction to the algebraic formulation of quantum mechanics and 
quantum field theory. I did not aim at mathematical rigor nor completeness. I have not men- 
tioned recent developments and applications in the direction of gauge theories, of two dimen- 
sional conformal field theories, of quantum field theory in non trivial (but classical) gravita- 
tional background. 

However I hope to have conveyed the idea that the "canonical structure of quantum me- 
chanics" - complex Hilbert space of states, algebra of operators, Born rule for probabilities - 
is quite natural and is a representation of an underlying more abstract structure: a real alge- 
bra of observables + states, consistent with the physical concepts of causality, reversibility and 
locality / separability. 
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Chapter 4 

The quantum logic formalism 



4.1 Introduction: measurements as logic 

The quantum logic formalism is another interesting, albeit more abstract, way to formulate 
quantum physics. The bonus of this approach is that one does not have to assume that the set 
of observables of a physical system is embodied with the algebraic structure of an associative 
unital algebra. As we have discussed in the previous section, the fact that one can "add" and 
"multiply" observables is already a highly non trivial assumption. This algebraic structure is 
natural in classical physics since observables form a commutative algebra, coming from the 
action of adding and multiplying results of different measurements. In quantum physics this 
is not equivalent, and we have seen for instance that the GNS construction relates the algebra 
structure of observables to the Hilbert space structure of pure states. In particular to the super- 
position principle for states comes from the addition law for observables. In the quantum logic 
formulations this algebraic structure itself comes out somehow naturally from the symmetries 
of the measurement operations considered on the physical system. 

The "quantum logic" approach was initiated by G. Birkhoff^jand J. von Neumann (again!) 
in l|BvN36| . It was then (slowly) developped, notably by physicists like G. Mackey [Mac63], 
J. M. Jauch [Jau68| and C. Piron [Pir64, Pir76|, and mathematicians like Varadarajan[Var85[. A 
good reference on the subject (not very recent but very valuable) is the book by E. Beltrametti 
and G. Cassinelli IIBC81I . 

The terminology "quantum logic" for this approach is historical and is perhaps not fully 
adequate, since it does not mean that a new kind of logic is necessary to understand quantum 
physics. It is in fact not a "logic" in the mathematical sense, and it relies on the standard logics 
used in mathematics and exact sciences. It could rather be called "quantum propositional calcu- 
lus" or "quantum propositional geometry", where the term "proposition" is to be understood 
as "test" or "projective measurement" on a quantum system. The mathematics underlying the 
quantum logic formalism have applications in various areas of mathematics, logic and com- 
puter sciences. The quantum logic approaches do not form a unified precise and consistent 
framework like algebraic quantum field theory. It has several variants, most of them insisting 
on propositions, but some older one relying more on the concept of states (the so called convex 
set approaches). Some recent formulations of quantum physics related to quantum logic have 
some grandiose categorial formulations. 

In this course I shall give a short, partial presentation of this approach, from a personal point 
of viewQ I shall try to stress where the physical concepts of causality, reversibility and locality 

1. An eminent mathematician, not to be confused with his father, the famous G. D. Birkhoff of the ergodic 
theorem 

2. with the usual reservation on the lecturer's qualifications 
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play a role, in parallel to what I tried to do for the algebraic formalism. My main reference and 
source of understanding is the review by Beltrametti and Cassinelli [BC8l|. 

The idea at the root of this approach goes back to J. von Neumann's book llvN55l lvN32L It 
starts from the observation that the observables given by projectors, i.e. operators P such that 
P 2 = P = P + , correspond to propositions with YES or NO (i.e. TRUE or FALSE) outcome in a 
logical system. An orthogonal projector P onto a linear subspace P C % is indeed the operator 
associated to an observable that can take only the values 1 (and always 1 if the state tp G P is 
in the subspace P) or (and always if the state xp G P belongs to the orthogonal subspace 
to P). Thus we can consider that measuring the observable P is equivalent to perform a test on 
the system, or to check the validity of a logical proposition p on the system. 

P = orthogonal projector onto P <H> proposition p (4.1.1) 

If the result is 1 the proposition p is found to be TRUE, and if the result is the proposition p is 
found to be FALSE. 

(tp\F\ip) = l =^ p always TRUE on | tp) (4.1.2) 

The projector 1 — P onto the orthogonal subspace P is associated to the proposition not p, 
meaning usually that p is false (assuming the law of excluded middle) 

(ip\F\xp) = p always FALSE on \xp) (4.1.3) 

so that 

1 — P = orthogonal projector onto <H> proposition not p (4.1.4) 

In classical logic the negation not is denoted in various ways 

not a = -ia, a', a, a, ~a (4.1.5) 

I shall use the first two notations. 

Now if two projectors A and B (on two subspaces A and B) commute, they correspond to 
classically compatible observables A and B (which can be measured independently), and to a 
pair of propositions a and b of standard logic. The projector C = AB = BA on the intersection 
of the two subspaces C = A C\ B corresponds to the proposition c = "a and b" = a A b. 
Similarly the projector D on the linear sum of the two subspaces D = A + B corresponds to the 
proposition d="a or b" =a V b. 

Af]B o aAb = aandb , A + B o aVb = aorb (4.1.6) 

Finally the fact that for subspaces A C B, i.e. for projectors AB = BA = A, is equivalent to 
state that a implies b 

AcB o a b (4.1.7) 

This is easily extended to a general (possibly infinite) set of commuting projectors. Such a set 
generates a commuting algebra of observables A, which corresponds to the algebra of functions 
on some classical space X. The set of corresponding subspaces, with the operations of linear 
sum, intersection and orthocomplementation (+, n, _L), is isomorphic to a Boolean algebra of 
propositions with (V, A, -i), or to the algebra of characteristic functions on subsets of X. Indeed, 
this is just a reformulation of "ordinary logic"[^]where characteristics functions of measurable 
sets (in a Borel c-algebra over some set X) can be viewed as logical propositions. Classically 
all the observables of some classical system (measurable functions over its phase space Q) can 
be constructed out of the classical propositions on the system (the characteristic functions of 
measurable subsets of Q) . 

3. In a very loose sense, I am not discussing mathematical logic theory. 
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In quantum mechanics all physical observables can be constructed out of projectors. For 
general, non necessarily commuting projectors A and B on subspaces A and B one still associate 
propositions a and b. The negation -ia, the "and" (or "meet") a A b and the "or" (or "join") 



a V b are still defined by the geometrical operations _L, n and + on subspaces given by 4.1.6 



The "implies" => is also defined by the C as in 4.1.7 

However the fact that in a Hilbert space projectors do not necessarily commute implies that 
the standard distributivity law of propositions 

A A (B VC) = (A AB) V (A AC) V = or A = and (4.1.8) 

does not hold. It is replaced by the weaker condition (A, B, C are the linear subspaces associated 
to the projectors A, B, C) 

An (B + C) D ((A n B) + (A n C)) (4.1.9) 
which corresponds in terms of propositions (projectors) to 

(aAb)V(aAc) =>• a A (b V c) (4.1.10) 

or equivalently 

aV(bAc) => (aVb)A(aVc) (4.1.11) 



A simple example is depicted on fig. 4.2 The vector space V in the plane (dim=2) and the 
subspaces A, B and C are three different coplanar lines (dim=l). B + C = V , hence A n (B + 
C) = A n V = A, while AnB = AnC = {0}; hence AnB + AnC = {0}. 

Therefore the set of projectors on a Hilbert space do not generate a Boolean algebra. The 
purpose of the quantum logic approach is to try to understand what are the minimal set of con- 
sistency requirements on such propositions/ measurements, based on logical consistency (as- 
suming that internal consistency has something to do with the physical world), and on physical 
requirements (in particular causality, reversibility and locality) and what are the consequences 
for the formulation of physical laws. I discuss the conservative approach where one does not 
try to use a non-classical logic (whatever it means) but discuss in a classical logic framework 
the statements which can be made on quantum systems. 

There are many variants of the formalism: some insist on the concept and the properties of 
the propositions (the test), some others on those of the states (the probabilities). They are often 
equivalent. Here I present a version based primarily on the propositions. 
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Figure 4.2: A simple example of non-distributivity 



4.2 A presentation of the principles 

4.2.1 Projective measurements as propositions 

As explained above, in the standard formulation of quantum mechanics, projectors are as- 
sociated to "ideal" projective measurements ("projective measurement "of the first kind", or 
"non-demolition" projective measurements). The fundamental property of such measurements 
is that if the system is already in an eigenstate of the projector, for instance P\ip) = \ip), then 
after measurement the state of the system is unchanged. This means that successive mea- 
surements of P give always the same result (1 or TRUE). Without going into a discussion of 
measurements in quantum physics, let me stress that this is of course an idealisation of ac- 
tual measurements. In general physical measurements are not ideal measurements, they may 
change the state of the system, while gaining some information on the system we in general 
loose some other information, they may and in general do destroy part or the whole of the sys- 
tem studied. Such general processes may be described by the formalism of POVM's (Projective 
Operator Valued Measures). 

In the following presentation, I assume that such ideal repeatable measurements are (in 
principle ) possible for all the observable properties of a quantum system. The formalism here 
tries to guess what is a natural and minimal set of physically reasonable and logically consistent 
axioms for such measurements. 

4.2.2 Causality, POSET's and the lattice of propositions 

One starts from a set of propositions or tests C (associated to ideal measurements of the 
first kind on a physical system) and from a set of states S (in a similar sense as in the algebraic 
formulation, to be made more precise along the discussion). On a given state cp the test (mea- 
surement) of the proposition a can give TRUE (i.e. YES or 1) or FALSE (NO or 0). It gives TRUE 
with some probability. In this case one has extracted information on the system, which is now 
(considered to be) in a state q> a . 

I note <p(fl) the probability that a is found TRUE, assuming that the system was in state cp 
before the test. I shall not discuss at that stage what I mean exactly by probability (see the pre- 
vious discussions). 
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4.2.2.a - Causal order relation: 

The first ingredient is to assume that there an order relation a < b between propositions. 
Here it will be defined by the causal relation 

a <b for any state cp, if a is found true, then b will be found true (4.2.1) 

Note that this definition is causal (or dynamical) from the start, as to be expected in quantum 
physics. It is equivalent to 

a<b ^ yep, <p a {b) = 1 (4.2.2) 

One assumes that this causal relation has the usual properties of a partial order relation. 
This amounts to enforce relations between states and propostions. First one must have: 

a < a (4.2.3) 

This means that if a has been found true, the system is now in a state such that a will always be 
found true. Second one assumes also that 

a < b and b <c =4> a < c (4.2.4) 

This is true in particular when, if the system is in a state tp such that b is always true, then after 
measuring b, the system is still is the same state ip. In other word, xp{V) = 1 xpj, = ip. This 
is the concept of repeatability discussed above. 

These two properties makes ^ a preorder relation. 

One also assumes that 

a<b and b < a => a = b (4.2.5) 

This means that tests which give the same results on any states are indistinguishable. This also 
means that one can identify a proposition a with the set of states such that a is always found 
to be true (i.e. ip(a) = 1). 4.2.5 makes ^ a partial order relation and C a partially ordered set or 
POSET. 



4.2.2.b - AND (meet A): 

The second ingredient is the notion of logical cunjunction AND. One assumes that for any 
pair of test a and b, there is a unique greater proposition a A b such that 

a f\b < a and a Kb <b (4.2.6) 

in other word, there is a unique a A b such that 

c -< a and c ~< b c ~< a A b 



NB: this is a non trivial assumption, not a simple consequence of the previous ones. It can be 
justified using the notion of filters (see Jauch) or that of questions associated to propositions 
(see Piron). Here to make things simpler I just present it as an assumption. On the other hand 



it is very difficult to build anything without this assumption. Note that 4.2.6 implies 4.2.5 
This definition extends to any set A of propositions 



f\ A = /\{a G A} = greatest c : c ^ a, Va G A 
I do not discuss if the set A is finite or countable. 



(4.2.7) 
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4.2.2.C - Logical OR (join V): 

From this we can infer the existence of a logical OR (by using Birkhoff theorem) 

a yb = /\{c : a < c and b < c} (4.2.8) 
which extents to sets of propositions 

\/A = /\{b : a < b, Vfl €A} (4.2.9) 

4.2.2. d - Trivial 1 and vacuous propositions: 

It is natural to assume that there is a proposition 1 that is always true 

for any state (p, 1 is always found to be true, i.e. </>(l) = 1 (4.2.10) 
and another proposition that is never true 

for any state </>, is never found to be true, i.e. </>(0) = (4.2.11) 

Naturally one has 

l = \/£and = A £ (4.2.12) 

With these assumptions and definitions the set of propositions C has now the structure of a 
complete lattice. 

4.2.3 Reversibility and orthocomplementation 

4.2.3. a - Negations a' and 'a 

I have not yet discussed what to do if a proposition is found to be false. To do so one must 
introduce the seemingly simple notion of negation or complement. In classical logic this is easy. 
The subtle point is that for quantum systems, where causality matters, there are two inequiv- 
alent ways to introduce the negation. These two definitions becomes equivalent only if one 
assumes that propositions on quantum systems share a property of causal reversibility. In this 
case, one recovers the standard negation of propositions in classical logic, and ultimately this 
will lead to the notion of orthogonality and of scalar product of standard quantum mechan- 
ics. Thus here again, as in the previous section, reversibility appears to be one of the essential 
feature of the principles of quantum physics. 

Negation - definition 1: To any proposition a one can associate its negation (or complement 
proposition) a' defined as 

for any state <p, if a is found to be true, then a 1 will be found to be false (4.2.13) 

a' can be defined equivalently as 

a 1 = \J{b such that on any state (p, if a is found true, then b will be found false} (4.2.14) 
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Negation - definition 2: It is important at that stage to realize that, because of the causality 
ordering in the definition, there is an alternate definition for the complement, that I denote 'a, 
given by 

for any state <p, if 'a is found to be true, then a will be found to be false (4.2.15) 
or equivalently 

'a = \J{b; such that on any state <p , if b is found true, then a will be found false} (4.2.16) 

These two definitions are not equivalent, and they do not necessarily fulfill the properties of the 
negation in classical propositional logic^j . 

-i(-ifl)=a and -i (a A b) = ->« V —b 

These problems come from the fact that the definition for the causal order a < b does not 
implies that b' < a', as in classical logic. Indeed the definition 4.2.1 for a -< b implies that for 
every state 

if b is found false, then a was found false (4.2.17) 

while V -< a! would mean 

if b is found false, then a will be found false (4.2.18) 

or equivalently 

if a is found true, then b was found true (4.2.19) 



4.2.3.b - Causal reversibility and negation 

In order to build a formalism consistent with what we know of quantum physics, we need 
to enforce the condition that the causal order structure on propositions is in fact independent 
of the choice of a causal arrow "if • • • , then • • • will • • • " versus "if • • • , then • • • was • • • " . This 
is nothing but the requirement of causal reversibility and it is enforced by the following simple 
but very important condition. 



Causal reversibility: One assumes that the negation a' is such that 

a < b b' r< a' (4.2.20) 

With this assumption, it is easy to show that the usual properties of negation are satisfied. 
The two alternate definitions of negation are now equivalent 

a' = 'a = (4.2.21) 

and may be denoted by the standard logical symbol -i. We then have 

{a')' = a (4.2.22) 

and 

(a A h)' = a'yb' (4.2.23) 

4. The point discussed here is a priori not connected to the classical versus intuitionist logics debate. Remember 
that we are not discussing a logical system. 
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as well as 



and 



l' 



(4.2.24) 



a A a' = , a V a ' = 1 (4.2.25) 

A lattice C with a complement with the properties 4.2.20p~2.25 is called an orthocomple- 
mented complete lattice (in short OC lattice). For such a lattice, the couple (a, a') describes what 
is called a perfect measurement. 



NB: Note that in Boolean logic, the implication — > can be defined from the negation -i. Indeed 
a — > b means ->a V b. Here it is the negation -i which is defined out of the implication 



4.2.3.C - Orthogonality 

With reversibility and complement, the set of propositions starts to have properties similar 
to the set of projections on linear subspaces of a Hilbert space^] The complement a' of a propo- 
sition a is similar to the orthogonal subspace P of a subspace P. This analogy can be extended 
to the general concept of orthogonality. 



Orthogonal propositions: 

Two proposition a and b are orthogonal, if b ^ a (or equivalently a < b'). (4.2.26) 
This is noted 

a Lb (4.2.27) 



Compatible propositions: 

OC lattices contain also the concept of classical propositions. A subset of an OC lattice £ is a 
sub lattice £' if it is stable under the operations A, V and ' (hence it is itself an OC lattice). To 
any subset S C C on can associate the sublattice generated by S, defined as the smallest 
sub lattice £' of C which contains S. 

A (sub)lattice is said to be Boolean if it satisfy the distributive law of classical logic a A (b\/ 
c) = (a A b) V (a Ac). 

A subset S of an OC lattice is said to be a subset of compatible propositions if the generated 
lattice C$ is Boolean. 

Compatible propositions are the analog of commuting projectors, i.e. compatible or com- 
muting observables in standard quantum mechanics. For a set of compatible propositions, one 
expects that the expectations of the outcomes YES or NO will satisfy the rules of ordinary logic. 

Orthogonal projection: The notion of orthogonal projection onto a subspace can be also for- 
mulated in this framework as 

projection of a onto b = O b (fl) = b A {a V b') (4.2.28) 

This projection operation is often called the Sasaki projection. Its dual (O^a'))' = V V [a A b) 

is called the Sasaki hook (b — > a). It has the property that even if a ^ b, if for a state xp the Sasaki 

hook (a b) is always true, then for this state ip, if a is found true then b will always be found 
true. 

5. One should be careful for infinite dimensional Hilbert spaces and general operator algebras. Projectors corre- 
spond in general to orthogonal projections on closed subspaces. 
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4.2.4 Subsystems of propositions and orthomodularity 
4.2.4.a - What must replace distributivity? 

The concept of orthocomplemented lattice of propositions is not sufficient to reconstruct a 
consistent quantum formalism. There are mathematical reasons and physical reasons. 

One reason is that if the distributive law A A ((B V C) = (AAB)V(AAC) is known not 
to apply, assuming no restricted distributivity condition is not enough and leads to too many 
possible structures. In particular in general a lattice with an orthocomplementation -i may be 
endowed with several inequivalent ones! This is problematic for the physical interpretation of 
the complement as a —> TRUE <=> a' —> FALSE. 

Another problem is that in physics one is led to consider conditional states and conditional 
propositions. In classical physics this would correspond to the restriction to some subset Q' of 
the whole phase space Q of a physical system, or to the projection Q — > Q'. Such projections or 
restrictions are necessary if there are some constraints on the states of the system, if one has ac- 
cess only to some subset of all the physical observables of the system, or if one is interested only 
in the study of a subsystem of a larger system. In particular such a separation of the degrees of 
freedom is very important when discussing locality: we are interested in the properties of the 
system we can associate to (the observables measured in) a given interval of space and time, 
as already discussed for algebraic QFT It is also very important when discussing effective low 
energy theories: we want to separate (project out) the (un-observable) high energy degrees of 
freedom from the (observable) low energy degrees of freedom. And of course this is crucial to 
discuss open quantum systems, quantum measurement processes, decoherence processes, and 
the emergence of classical degrees of freedom and classical behaviors in quantum systems. 

4.2.4.b - Sublattices and weak-modularity 

In general a subsystem is defined from the observables (propositions) on the system which 
satisfy some constraints. One can reduce the discussion to one constraint a. If £ is an orthocom- 
plemented lattice and a a proposition of £, let us considers the subset £ <a of all propositions 
which imply a 

£ <a = {b e £ : b < a] (4.2.29) 
One may also consider the subset of propositions £ >a of propositions implied by a 

£ >a = {be£: a<b} = (£ <a ,)' (4.2.30) 

The question is: is this set of propositions £ <a still an orthocomplemented lattice? One takes 
as order relation -<, V and A in £ <a the same than in £ and as trivial and empty propositions 
l< fl = a, <fl = 0. Now, given a proposition b G £ <a , one must define what is its complement 
b' <a in £ <a . A natural choice is 

b' <a = b'Aa (4.2.31) 

but in general with such a choice £ <a is not an orthocomplemented lattice, since it is easy to 
find for general AC lattices counterexamples such that one may have b\l b' <a ^ a. 

Weak-modularity: In order for £ <a to be an orthocomplemented lattice (for any a E £), the 
orthocomplemented lattice £ must satisfy the weak-modularity condition 

b X a ==> (a A b') V b = a (4.2.32) 

This condition is also sufficient. 
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4.2.4.C - Orthomodular lattices 

Orthomodularity: An OC lattice which satisfies the weak-modularity condition is said to be 
an orthomodular lattice (or OM lattice)]^] Clearly if C is OM, for any a e C, £ <a is also OM, as 
well as £>«. 

Equivalent definitions: Weak-modularity has several equivalent definitions. Here are two in- 
teresting ones: 

- a <b a and b are compatible. 

- the orthocomplementation a — > a' is unique in C 

Irreducibility: For such lattices one can also define the concept of irreducibility. We have seen 
that two elements a and b of £ are compatible (or commute) if they generate a Boolean lattice. 
The center C of a lattice C is the set of a 6 C which commute with all the elements of the lattice 
C. It is obviously a Boolean lattice. A lattice is irrreducible if its center C is reduced to the trivial 
lattice C = {0,1}. 

4.2.4. d - Weak-modularity versus modularity 

NB: The (somewhat awkward) denomination "weak-modularity" is historical. Following 
Birkhoff and von Neumann the stronger "modularity" condition for lattices was first consid- 
ered. Modularity is defined as 

a<b => (aV c) Ab = a\/ (c Ab) (4.2.33) 

Modularity is equivalent to weak modularity for finite depth lattices (as a particular case the 
set of projectors on a finite dimensional Hilbert space for a modular lattice). But modularity 
turned out to be inadequate for infinite depth lattices (corresponding to the general theory of 
projectors in infinite dimensional Hilbert spaces). The theory of modular lattice has links with 
some W*-algebras and the theory of "continuous geometries" (see e.g. MvN60l ). 

4.2.5 Pure states and AC properties 

Orthomodular (OM) lattices are a good starting point to consider the constraints that we 
expect for the set of ideal measurements on a physical system, and therefore to study how one 
can represent its states. In fact one still needs two more assumptions, which seem technical, 
but which are also very important (and quite natural from the point of view of quantum in- 
formation theory). They rely on the concept of atoms, or minimal proposition, which are the 
analog for propositions of the concept of minimal projectors on of pure states in the algebraic 
formalism. 

4.2.5. a - Atoms 

An element a of an OM lattice is said to be an atom if 

b<a and b ^ a b = (4.2.34) 

This means that a is a minimal non empty proposition; it is not possible to find another propo- 
sition compatible with a which allows to obtain more information on the system than the infor- 
mation obtained if a is found to be TRUE. 

6. In French: treillis orthomodulaire, in German: Orthomodulare Verband. 
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Atoms are the analog of projectors on pure states in the standard quantum formalism (pure 
propositions). Indeed, if the system in in some state xp, before the measurement of a, if a is an 
atom and is found to be true, the system will be in a pure state tp a after the measure. 

4.2.5.b - Atomic lattices 

A lattice is said to be atomic if any non trivial proposition b 7^ in C is such that there is 
at least one atom a such that a < b (i.e. any proposition "contains" at least one minimal non 
empty proposition). For an atomic OM lattice one can show that any proposition b is then the 
union of its atoms (atomisticity). 

4.2.5.C - Covering property 

Finally one needs also the covering property. The formulation useful in the quantum frame- 
work is to state that if a is a proposition and b an atom not in the complement a' of a, then the 
Sasaki projection of b onto a, O fl (b) = a A (b V a'), is still an atom. 

The original definition of the covering property for atomic lattices by Birkhoff is: for any 
b £ £ and any atom a G C such that a Ab = 0, a V b covers b, i.e. there is no c between b and 
a V b such that b -< c ~< a V b. 

This covering property is very important. It means that when reducing a system to a sub- 
system by some constraint (projection onto a), one cannot get a non-minimal proposition out 
of a minimal one. This would mean that one could get more information out of a subsystem 
than from the greater system. In other word, if a system is in a pure state, performing a perfect 
measurement can only map it onto another pure state. Perfect measurements cannot decrease 
the information on the system. 

The covering property is in fact also related to the superposition principle. Indeed, it implies that (for 
irreducible lattices) for any two difference atoms a and b, there must be a third atom c different from a 
and b such that c < a\J b. Thus, in the weakest possible sense (remember we have no addition) c is a 
superposition of a and b. CHECK 

An atomistic lattive with the covering property is said to be an AC lattice. As mentionned 
before these properties can be formulated in term of the properties of the set of states on the 
lattice rather than in term of the propositions. I shall not discuss this here. 

4.3 The geometry of orthomodular AC lattices 

I have given one (possible and personal) presentation of the principles at the basis of the 
quantum logic formalism. It took some time since I tried to explain both the mathematical 
formalism and the underlying physical ideas. I now explain the main mathematical result: 
the definition of the set of propositions (ideal measurements) on a quantum system as an or- 
thomodular AC lattice can be equivalently represented as the set of orthogonal projections on 
some "generalized Hilbert space". 

4.3.1 Prelude: the fundamental theorem of projective geometry 

The idea is to extend a classical and beautiful theorem of geometry, the Veblen- Young theo- 
rem. Any abstract projective geometry can be realized as the geometry of the affine subspaces 
of some left-module (the analog of vector space) on a division ring K (a division ring is a non- 
commutative field). This result is known as the "coordinatization of projective geometry". 
Classical references on geometry are the books by E. Artin Geometric Algebra |Art57l , R. Baer 
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Linear algebra and projective geometry [Bae05| or Conway . More precisely, a geometry on a 
linear space is simply defined by a set of points X, and a set of lines £ of X (simply a set of 
subsets of X). 



Theorem: If the geometry satisfies the following axioms: 

1. Any line contains at least 3 points, 

2. Two points lie in a unique line, 

3. A line meeting two sides of triangle, not at a vertex of the triangle, meets the third side 
also (Veblen's axiom), 

4. There are at least 4 points non coplanar (a plane is defined in the usual way from lines), 

then the corresponding geometry is the geometry of the affine subspaces of a left module M on 
a division ring K (a division ring is a in general non-commutative field). 



Discussion: The theorem here is part of the Veblen- Young theorem, that encompasses the 
cease when the 4th axiom is not satisfied. The first two axioms define a line geometry struc- 
ture such that lines are uniquely defined by the pairs of points, but with some superposition 



principle. The third axiom is represented on 4.3 



Figure 4.3: Veblen's axiom 



The fourth one is necessary to exclude some special non-Desarguesian geometries. 

Let me note that the division ring K (an associative algebra with an addition +, a multipli- 
cation x and an inverse x — > x^ 1 ) is constructed out of the symmetries of the geometry, i.e. of 
the automorphisms, or applications X — > X, C — > C, etc. which preserve the geometry. With- 
out giving any details, let me illustrate the case of the standard real projective plane (where 
K = R). The field structure on R is obtained by identifying R with a projective line £ with three 
points 0, 1 and oo. The "coordinate" x G R of a point X G I is identified with the cross-ratio 



(X, 1;0, oo). On Fig. 4.4 are depicted the geometrical construction of the addition X + Y 



and of the multiplication X x Y of two points X and Y on a line £. 
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4.3.2 The projective geometry of orthomodular AC lattices 
4.3.2.a - The coordinatization theorem 

Similar "coordinatization" theorems hold for the orthomodular AC lattices that have been 
introduced in the previous section. The last axioms AC (atomicity and covering) play a similar 
role as the axioms of abstract projective geometry allowing to define "points" (the atoms), lines, 
etc , with properties similar to the first 3 axioms of linear spaces. The difference with projective 
geometry is the existence of the orthocomplementation (the negation -i) which allows to define 
an abstract notion of orthogonality _L, and the specific property of weak-modularity (which 
will allows to define in a consistent way what are projections on closed subspaces). 

Let me first state the main theorem 

Theorem: Let £ be a complete irreducible orthocomplemented AC lattice with length > 3 (i.e. 
at least three 4 different levels of proposition (Z)~<a^b~<c~<d~<\). Then the "abstract" 
lattice C can be represented as the lattice C(V) of the closed subspaces of a left-module]^] V on 
a division ring^jiC with a Hermitian form /. The ring K, the module V and the form / have the 
following properties: 

- The division ring K has an involution * such that (xy)* = y*x* 

- The vector space V has a non degenerate Hermitian (i.e. sesquilinear) form / : V x V — > 

K 

a,b G V , f{a,b) = (a|b) G K , (a|b) = (b|a)* (4.3.1) 

- The Hermitian form / defines an orthogonal projection and associates to each linear sub- 
space M of V its orthogonal M . 

M 1 = {b G v : (b|a) =0 Va G M} (4.3.2) 

- The closed subspaces of V are the subspaces M such that (M^) 1 - = M. 

- The Hermitian form is orthomodular, i.e. for any closed subspace, M + M = V. 

- The OM structure {■<, A , ') on the lattice C is isomorphic to the standard lattice structure 
(C , n , _!_) (subspace of, intersection of, orthogonal complement of) over the space C(V) 
of closed linear subspaces of V. 

7. A module is the analog of a vector space, but on a ring instead of a (commutative) field 

8. A division ring is the analog of a field, but without commutativity 
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- Moreover, V and K are such that there is some element a of V with "norm" unity f(a, a) = 
1 (where 1 is the unit element of K). 

I do not give the proof. I refer to the physics literature: ( [BC81| chapter 21, [|Pir64l [Pir76l , 
and to the original mathematical literature IIBvN36l |MM71| IVar85l . 

Thus this theorem states that an OM AC lattice can be represented as the lattice of orthog- 
onal projections over the closed linear subspaces of some "generalized Hilbert space" with a 
quadratic form defined over some non-commutative field K. This is very suggestive of the fact 
that Hilbert spaces are not abstract and complicated mathematical objects (as still sometimes 
stated), but are the natural objects to describe and manipulate ideal measurements in quantum 
physics. In particular the underlying ring K and the algebraic structure of the space V come 
out naturally from the symmetries of the lattice of propositions C. 

4.3.2.b - Discussion: which division ring K? 

The important theorem discussed before is very suggestive, but is not sufficient to "derive" 
standard quantum mechanics. The main question is which division algebra K and which in- 
volution * and Hermitian form / are physically allowed? Can one construct physical theories 
based on other rings than the usual K = C (or R or H)? 

The world of division rings is very large! The simplest one are finite division rings, where 
the first Wedderburn theorem implies that K is a (product of) Galois fields F p = Z/Z p (p 
prime). Beyond C, R and H, more complicated ones are rings of rational functions F(X), up to 
very large ones (like surreal numbers...), but still commutatives, to non-commutatives rings. 

However, the requirements that K has an involution, and that V has a non degenerate her- 
mitian form, so that C(V) is a OM lattice, put already very stringent constraints on K. For 
instance, it is well known that finite fields like the F p (p prime) do not work. Indeed, it is easy 
to see that the lattice C(V) of the linear subspaces of the finite dimensional subspaces of the 
n-dimensional vector space V = (F p ) n is not orthomodular and cannot be equipped with a 
non-degenerate quadratic form! Check with p = n = 3! But still many more exotic division 
rings K than the standard R, C (and H) are possible at that stage . 

4.3.3 Towards Hilbert spaces 

There are several arguments that point towards the standard solution: V is a Hilbert space 
over R, C (or H). However none is completely mathematically convincing, if most would 
satisfy a physicist. Remember that real numbers are expected to occur in physical theory for 
two reasons. Firstly we are trying to compute probabilities p, which are real numbers. Secondly 
quantum physics must be compatible with the relativistic concept of space-time, where space 
(and time) is described by continuous real variables. Of course this is correct as long as one 
does not try to quantize gravity. 

We have not discussed yet precisely the structure of the states tp, and which constraints they 
may enforce on the algebraic structure of propositions. Remember that it is the set of states £ 
which allows to discuss the partial order relation -< on the set of propositions C Moreover 
states tp assign probabilities ip(a) G [0, 1] to propositions a, with the constraints that if a _L b, 
tp(a\/b) = ip(a) +tp(b). Moreover the propositions a G C (projective measurements) define via 
the Sasaki orthogonal projections n a a set of transformations C —¥ C, which form a so called 
Baer *-semi group. On the same time, propositions a G C define mappings tp — > xp a on the 
states. Since as in the algebraic formalism, convex linear combinations of states are states, £ 
generate a linear vector space E, and form a convex subset £ C E. Thus there is more algebraic 
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structure to discuss than what I explained up to now. I refer to |BC81 [, chapters 16-19, for more 
details. I shall come back to states when discussing Gleason's theorem in the next section. 

Assuming some "natural" continuity or completeness conditions for the states leads to the- 
orems stating that the division ring K must contain the field of real numbers R, hence is IR, C 
or H, and that the involution * is continuous, hence corresponds to the standard involution 
x* = x, x* = x or x* = x* respectively. See BBC81L chapter 21.3. 

Another argument comes from an important theorem in the theory of orthomodular lattices, 
which holds for lattices of projections in infinite dimensional modules. 



Soler's Theorem: (Soler 1995) Let C = Ck{V) be an irreducible OM AC lattice of compact 
linear suspaces in a left-module V over a divison ring K, as discussed above. If there is an 
infinite family {z?,} of orthonormal vectors in V such that (vi\vj) = b^j with some / G K then 
the division ring K can only be R ; C or H. 

The proof of this highly non trivial theorem is given in [Sol95|. It is discussed in more 
details in HHol95l . 

The assumptions of the theorem state that there an infinite set of mutually compatible atoms 
{fl;}iei in C (commuting, or causally independent elementary propositions a,-), and in addition 
that there is some particular symmetry between the generators Vj G V of the linear spaces (the 
lines or rays) of these propositions. 

The first assumption is quite natural if we take into account space- time and locality in quan- 
tum physics. Let me consider the case where the physical space in which the system is defined 
to be infinite (flat) space or some regular lattice, so that it can be separated into causally inde- 



pendent pieces O a (labelled by ol G A some infinite lattice). See for instant Fig. 4.5 It is suffi- 
cient to have one single proposition a K relative to each O a only (for instance "there is one parti- 
cle in O a ") to build an infinite family of mutually orthogonal propositions b a = a a A ( A/j^ a 
in C. Out of the b a , thanks to the atomic property (A), we can extract an infinite family of or- 
thogonal atoms c a . 



Figure 4.5: A string of causal diamonds (in space-time) 



However this does not ensure the second assumption: the fact that the corresponding 
Vi G V are orthonormals. The group of space translations T must act as a group of automor- 
phism on the lattice of linear subspaces C = Ck{V) (a group of automorphisms on a OC lattice 
C = Ck{V) is a group of transformations which preserves the OC lattice structure (;< , A , ') or 
equivalently (C , n , _L)). There must correspond an action (a representation) of the translation 
group T on the vector space V, and on the underlying field K. If the action is trivial the con- 
ditions of Soler's theorem are fulfilled, but this is not ensured a priori. See for instance IIGL12I 
for a recent discussion of symmetries in orthomodular geometries. However I am not aware 
of a counterexample where a non standard orhomodular geometry (i.e. different from that of a 
Hilbert space on C (or R) carries a representation of a "physical" symmetry group such as the 
Poincare or the Galilean group of space-time transformations (representations of these groups 
should involves the field of real numbers IR in some form). 

From now on we assume that a quantum system may indeed be described by projectors in 
a real or complex Hilbert space. 
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One last remark. The coordinatization theorem depends crucially on the fact that the OM 
lattice £ is atomic, hence contain minimal propositions (atoms). They are the analog of minimal 
projectors in the theory of operator algebras. Hence the formalism discussed here is expected 
to be valid mathematically to describe only type I von Neumann algebras. I shall not elaborate 
further. 



4.4 Gleason's theorem and the Born rule 



4.4.1 States and probabilities 

In the presentation of the formalism we have not put emphasis on the concept of states, 
although states are central in the definition of the causality order relation ^ and of the ortho- 
complementation '. We recall that to each state xp and to each proposition a is associated the 
probability xp(a) for a to be found true on the state xp. In other word, states are probability mea- 
sures on the set of propositions, compatible with the causal structure. As already mentionned, 
the lattice structure of propositions can be formulated from the properties of the states on C. 

At that stage we have almost derived the standard mathematical formulation of quantum 
mechanics. Proposition (yes-no observables) are represented by othonormal projectors on a 
Hilbert space 7i. Projectors on pure states corresponds to projectors on one dimensional sub- 
spaces, or rays of % so the concept of pure states is associated to the vectors of ri. 

Nevertheless it remains to understand which are the consistent physical states, and what 
are the rules which determine the probabilities for a proposition a to be true in a state xp, in 
particular in a pure state. We remind that the states are in fact characterized by these probability 
distributions a — > xp{a) on C. Thus states must form a convex set of functions C — > [0, 1] and 
by consistency with the OM structure of C they must satisfy four conditions. These conditions 
define "quantum probabilities" 



Quantum probabilities: 

(1) #0e[o,i] (4-4.1) 

(2) 0(0) =0, xp(l) = 1 (4.4.2) 

(3) a^b => 3t/; such that 0(a) ^ xp(b) (4.4.3) 

(4) a Lb =>• xp(aVb) =xp(a)+xp(b) (4.4.4) 

Conditions (1) and (2) are the usual normalization conditions for probabilities. Condition (3) 
means that observables are distinguishable by their probabilities. Condition (4) is simply the 
fact that if a and b are orthogonal, they generate a Boolean algebra, and the associated proba- 
bilities must satisfy the usual sum rule. These conditions imply in particular that for any state 
xp, xp(^a) = 1 — xp(a), and that if a ■< b, then xp(a) < xp(b), as we expect. 

It remains to understand if and why all states xp can be represented by density matrices py, 
and the probabilities for propositions a given by xp(a) = tr(p^P a ), where P a is the projector onto 
the linear -subspace associated to the proposition a. This is a consequence of a very important 
theorem in operator algebras, Gleason's theorem |Gle57l. 



4.4.2 Gleason's theorem 



It is easy to see that to obtain quantum probabilities that satisfy the conditions 4.4.1 • 4.4.4 



it is sufficient to consider atomic propositions, i.e. projections onto 1 dimensional subspaces 
(rays) generated by vectors e = \e) (pure states) of the Hilbert space ri. Indeed, using 4.4.4 
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the probabilities for general projectors can be reconstructed (by the usual sum rule) from the 
probabilities for projections on rays. Denoting since there is no ambiguity for a state ip the 
probability for the atomic proposition e represented by the projection Pg onto a vector e = \e) G 
% as 

ip(e) = ip(P e ) = tp(e) P? = \e)(e\ = projector onto e (4.4.5) 
The rules I4.4TT 14.4.41 reduces to the conditions. 

Quantum probabilities for projections on pure states: For any states xp, the function xp(e) con- 
sidered as a function on the "unit sphere" of the rays over the Hilbert Space H (the projective 
space) S = TL*/K* must satisfy 

(1) tp(e) = xp(Ae) for any A e K such that |A| = 1 (4.4.6) 

(2) < ip(e) < 1 (4.4.7) 

(3) For any complete orthonormal basis of %, {e,}, one has ^ipjej) = 1 (4.4.8) 

i 

Gleason's theorem states the fundamental result that any such function is in one to one 
correspondence with a density matrix. 



Gleason's theorem: 

If the Hilbert space T-L over K = R or C is such that 



dim(-H) > 3 (4.4.9) 



then any function tp over the unit rays of % that satisfies the three conditions 4.4.6-4.4.8 is of 
the form 

xp(e) = {e-p tp -e) = {e\p ip \e) (4.4.10) 

where p$ is a positive quadratic form (a density matrix) over T-L with the expected properties 
for a density matrix 

Pf = Pf , P^>0 , tr(pf) = l (4.4.11) 



Reciprocally any such quadratic form defines a function ip with the three properties 4.4.6-4.4.8 



Gleason's theorem is fundamental. As we shall discuss more a bit later, it implies the Born 
rule. It is also very important when discussing (and excluding a very general and most natural 
class of) hidden variables theories. So let us discuss it a bit more, without going into the details 
of the proof. 

4.4.3 Principle of the proof 

The theorem is remarkable since there are non conditions on the regularity or measurabil- 
ity of the function tp. In the original derivation by Gleason [Gle57| he considers real "frame 
functions" / of weight W over U* = U\{Q} such that 

(1) f(e) = /(Ae) for any A/OGX (4.4.12) 

(2) / is bounded (4.4.13) 

(3) For any complete orthonormal basis of H, {e, }, ^/(e,) = W = constant (4.4.14) 
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and proves that such a function must be of the form 4.4.10 

f(e) = (e-Q-e) , Q quadradic form such that tx(Q) = W (4.4.15) 
It is easy to see that this is equivalent to the theorem as stated above, since one can add con- 



stants and rescale the functions / to go from 4.4.12 -4.4.14 to 4.4.6-4.4.8 The original proof goes 
into three steps 

1. Real Hilbert space, dim('H) = 3 and / a continuous frame function the theorem 
This is the easiest part, involving some group theory. Any frame function / is a real 
function on the unit two dimensional sphere S2 and if continuous it is square summable 
and can be decomposed into spherical harmonics 



/(*) = E/wW*0 



(4.4.16) 



\ ,m 



The theorem amounts to show that if / is a frame function of weight W = 0, then only the 
/ = 2 components of this decomposition 4.4.16 are non zero. Some representation theory 



(for the SO(3) rotation group) is enough. Any orthonormal (oriented) basis {e\,er,e^) of 
R 3 is obtained by applying a rotation R to the basis (e x , e y , e z ). Thus one can write 



(') 



m) +f(n 2 ) + /(H 3 ) = £E km < J m ,(R) V* 

/ m,m! 



(4.4.17) 



with the , (R) the Wigner D matrix for the rotation R, and the V^J the components of 



(0 



m,m 



the vectors in the spin / representation of SO(3), with components 

V (l) = {VS*} , V4° = Y/, m (0,0) + Y hm (7t/2,0) + Y hm {7t/2, zr/2) 



(4.4.18) 



If / is a frame function of weight W = 0, the l.h.s. of 4.4.17 is zero for any R £ SO(3). 
This implies that for a given /, the coefficients f\ m must vanish if the vector ^ 0, but 
are free if V W = 0. An explicit calculation shows that indeed 



y(0 



^0 if/ £2, 
= if / = 2. 



(4.4.19) 



This establishes the theorem in case (1). 

Real Hilbert space, dim('H) = 3 and / any frame function => / continuous. 

This is the most non-trivial part: assuming that the function is bounded, the constraint 



4.4.14 is enough to imply that the function is continuous! It involves a clever use of 
spherical geometry and of the frame identity f(^i) = W. The basic idea is to start 

1=1,2,3 

from the fact that since / is bounded, it has a lower bound f m j n which can be set to 0. Then 
for any e > 0, take a vector hq on the sphere such that |/(wo) — fmin\ < £ - It is possible 
to show that there is a neibourhood O of hq such that — /("2)| < Ce for any n\ 

and f?2 G 0. C is a universal constant. It follows that the function / is continuous at its 
minimum! Then it is possible, using rotations to show that the function / is continuous 
at any points on the sphere. 

3. Generalize to dim('H) > 3 and to complex Hilbert spaces. 

This last part is more standard and more algebraic. Any frame function f(n) defined on 
unit vectors n such that ||n|| = 1 may be extended to a quadratic function over vectors 

/(^) = IN 2 /(^/IN)- 



Francois David, 2012 



Lecture notes - November 27, 2012 



4.4. GLEASON'S THEOREM AND THE BORN RULE 



4-19 



For a real Hilbert space with dimension d > 3, the points (1) and (2) implies that the 
restriction of a frame function fin) to any 3 dimensional subspace is a quadratic form 
fiv) = (v-Q-v). A simple and classical theorem by Jordan and von Neumann shows 
that this is enough to define a global real quadratic form Q on the whole Hilbert space H 
through the identity 2 (x-Q-y) = fix + y) — fix — y). 

For complex Hilbert spaces, the derivation is a bit more subtle. One can first apply the al- 
ready obtain results to the restriction of frame functions over real submanifolds of H (real 
submanifolds are real subspaces of % such that (x-y) is always real). One then extends 
the obtained real quadratic form over the real submanifolds to a complex quadratic form 
on %. 

4.4.4 The Born rule 

The Born rule is a simple consequence of Gleason theorem. Indeed, any state (in the general 
sense of statistical state) corresponds to a positive quadratic form (a density matrix) p and 
given a minimal atomic proposition, which corresponds to a projector P = \a) (a\ onto the ray 
corresponding to a single vector (pure state) \a), the probability p for P of being true is 

p = (P) = tt(pP) = (a\p\a) (4.4.20) 

The space of states £ is thus the space of (symmetric) positive density matrices with unit trace 

space of states = £ = {p : p = p f , p> ,tr(p) = 1} (4.4.21) 

It is a convex set. Its extremal points, which cannot be written as a linear combination of twi 
different states, are the pure states of the system, and are the density matrices of rank one, i. e. 
the density matrices which are themselves projectors onto a vector \ip) of the Hilbert space. 

p = pure state => P = , l!</>|| = 1 (4.4.22) 

One thus derives the well known fact that the pure states are in one to one correspondence with 
the vectors (well... the rays) of the Hilbert space H that was first introduced from the basse 
observables of the theory, the elementary atomic propositions (the projectors P). Similarily, one 
recovers the simplest version of the Born rule: the probability to measure a pure state \q>) into 
another pure state \tp) (to "project" \q>) onto \ip)) is the square of the norm of the scalar product 

p(cp^ip) = !<<?#> | 2 (4.4.23) 

4.4.5 Physical observables 

One can easily reconstruct the set of all physical observables, and the whole algebra of 
observables A of the system. I present the line of the argument, without any attempt of math- 
ematical rigor. 

Any ideal physical mesurement of some observable O consists in fact in taking a family of 
mutually orthogonal propositions a if i.e. of commuting symmetric projectors P, on H such that 

Pf = Pi , Pi = P* , PiPj = PjPi = if i £ j (4.4.24) 

performing all the tests (the order is unimportant since the projectors commute) and assigning 
a real number o, to the result of the measurement (the value of the observable O) if a; is found 
true (this occurs for at most one fl ; ) and zero otherwise. In fact one should take an appropriate 
limit when the number of a, goes to infinity, but I shall not discuss these important points of 
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mathematical consistency. If you think about it, this is true for any imaginable measurement 
(position, speed, spin, energy, etc.). The resulting physical observable O is thus associated to 
the symmetric operator 

= X>P; (4.4.25) 

i 

This amounts to the spectral decomposition of symmetric operators in the theory of algebras 
of operators. 

Consider a system in a general state given by the density matrix p. From the general rules 
of quantum probabilities, the probability to find the value o, for the measurement of the ob- 
servable O is simply the sum of the probabilities to find the system in a eigenstate of O of 
eigenvalue Oj, that is 

p(0 -4 o f ) = tr(P/jt)) (4.4.26) 

and for a pure state | <p) it is simply 

(f\Pi\<p} = \(<p\<Pi}\ 2 , Wi) = jp^Pi\f) (4.4.27) 

Again the Born rule! The expectation value for the result of the measurement of O in a pure 
state tpa is obviously 

E[0;fl = <0)f = 5>p(O-> 0/ ) = 5>MW> = (tp\0\ip) (4.4.28) 

i i 

This is the standard expression for expectation values of physical observables as diagonal ma- 
trix elements of the corresponding operators. Finally for general (mixed) states one has obvi- 
ousy 

E[0;p] = <O> p = X>p(O->0 f ) = L°MPip) = tr(Op) (4.4.29) 

We have seen that the pure states generate by convex combinations the convex set £ of all 
(mixed) states xp of the system. Similarily the symmetric operators O = + generates (by opera- 
tor multiplication and linear combinations) a C*-algebra A of bounded operators B(TL) on the 
Hilbert space 7i. States are normalized positive linear forms on A and we are back to the stan- 
dard algebraic formulation of quantum physics. The physical observables generates an algebra 
of operators, hence an abstract algebra of observables, as assumed in the algebraic formalism. 
We refer to the section about the algebraic formalism for the arguments for preferring complex 
Hilbert spaces to real or quaternionic ones. 

4.5 Discussion 

This was a sketchy and partial introduction to the quantum logic approach for the for- 
mulation of the principle of quantum mechanics. I hope to have shown its relation with the 
algebraic formulation. It relies on the concepts of states and of observables as the algebraic for- 
mulation. However the observables are limited to the physical subset of yes/no proposition, 
corresponding to ideal projective measurements, without assuming a priori some algebraic 
structure between non-compatible propositions (non-commuting observables in the algebraic 
framework). I explained how the minimal set of axioms on these propositions and their actions 
on states, used in the quantum logic approach, is related to the physical concepts of causality, 
reversibility and separability/locality. The canonical algebraic structure of quantum mechan- 
ics comes out from the symmetries of the "logical structure" of the lattice of propositions. The 
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propositions corresponding to ideal projective measurements are realized on orthogonal pro- 
jections on a (possibly generalized) Hilbert space. Probabilities/ states are given by quadratic 
forms, and the Born rule follows from the logical structure of quantum probabilities through 
Gleason's theorem. 

This kind of approach is of course not completely foolproof. We have seen that the issue if 
the possible division algebras K is not completely settled. The strong assumptions of atomicity 
and covering are essential, but somehow restrictive compared to the algebraic approach (type 
II and III von Neumann algebras). It is sometime stated that it cannot treat properly the case 
of a system composed of two subsystems since there is no concept of '"tensorial product" of 
two OM-AC lattices as there is for Hilbert spaces and operator algebras. Note however that 
one should in general always think about multipartite systems as parts of a bigger system, not 
the opposite! Even in the algebraic formulation it is not known in the infinite dimensional 
case if two commutting subalgebras A\ and Ai of a bigger algebra A always correspond to the 
decomposition of the Hilbert space H into a tensor product of two subspaces Hi and Hi- 



IPhT 2012 



Introduction to quantum formalism[s] 



4-22 CHAPTER 4. THE QUANTUM LOGIC FORMALISM 



Francois David, 2012 Lecture notes - November 27, 2012 



5-1 



Chapter 5 

Additional discussions 



5.1 Quantum information approaches 

Quantum information science has undergone enormous developments in the last 30 years. 
I do not treat this wide and fascinating field here, but shall only discuss briefly some relations 
with the question of formalism. Indeed information theory leads to new ways to consider and 
use quantum theory. This renewal is sometimes considered as a real change of paradigm. 

The interest in the relations between Information Theory and Quantum Physics started 
really in the 70's from several questions and results: 

- The relations and conflicts between General Relativity and Quantum Physics: the theo- 
retical discovery of the Bekenstein-Hawking quantum entropy for black holes, the black 
hole evaporation (information) paradox, the more general Unruh effect and quantum 
thermodynamical aspects of gravity and of events horizons (with many recent develop- 
ments in quantum gravity and string theories, such as "Holographic gravity", "Entropic 
Gravity", etc.). 

- The general ongoing discussions on the various interpretations of the quantum formal- 
ism, the meaning of quantum measurement processes, and whether a quantum state rep- 
resent the "reality", or some "element of reality" on a quantum system, or simply the 
observer's information on the quantum system. 

- Of course the theoretical and experimental developments of quantum computing. See for 
instance [NC10J. It started from the realization that quantum entanglement and quantum 
correlations can be used as a resource for performing calculations and the transmission of 
information in a more efficient way than when using classical correlations with classical 
channels. 

- This led for instance to the famous "It from Bit" idea (or aphorism) of J. A. Wheeler (see 
e.g. in [Zur90|) and others (see for instance the book by Deutsch [Deu97|, or talks by 
Fuchs MFuc01l lFuc02|). Roughly speaking this amounts to reverse the famous statement 
of Laudauer "Information is Physics" into "Physics is Information", and to state that 
Information is the good starting point to understand the nature of the physical world 
and of the physical laws. 

This point of view has been developed and advocated by several authors in the area of 
quantum gravity and quantum cosmology. Here I shall just mention some old or recent at- 
tempts to use this point of view to discuss the formalism of "standard" quantum physics, not 
taking into account the issues of quantum gravity. 

In the quantum information inspired approaches a basic concept is that of "device", or 
"operation", which represents the most general manipulation on a quantum system. In a very 
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oversimplified presentation^] such a device is a "black box" with both a quantum input system 
A and quantum output system B, and with a set I of classical settings i G I and a set O of 
classical responses o G O. The input and output systems A and B may be different, and may 
be multipartite systems, e.g. may consist in collections of independent subsystems A = U A a , 

B = U Ba. 

a v 

This general concept of device encompass the standard concepts of state and of effect. A 
state corresponds to the preparation of a quantum system S in a definite state; there is no input 
A = 0, the setting i specify the state, there is no response, and B = S is the system. An effect, 
corresponds to a destructive measurement on a quantum system S; the input A = S is the 
system, there are no output B = 0, no settings i, and the response set O is the set of possible 
output measurements o. This concept of device contains also the general concept of a quantum 
channel; then A = B, there are no settings or responses. Probabilities p(i\o) are associated to 



A 





J 

I 




Figure 5.1: A general device, a state and an effect 



the combination of a state and an effect, this correspond to the standard concept of probability 
of observing some outcome o when making a measurement on a quantum state (labeled by i). 



Figure 5.2: Probabilities are associated to a couple state-effect 



General information processing quantum devices are constructed by building causal cir- 
cuits out of these devices used as building blocks, thus constructing complicated apparatus 
out of simple ones. An information theoretic formalism is obtained by choosing axioms on 
the properties of such devices (states and effects) and operational rules to combine these de- 
vices and circuits and the associated probabilities, thus obtaining for instance what is called in 
MCDP11I an operational probabilistic theory. This kind of approach is usually considered for fi- 
nite dimensional theories (which in the quantum case correspond to finite dimensional Hilbert 
spaces), both for mathematical convenience, and since this is the kind of system usually con- 
sidered in quantum information science. 

1. slightly more general than in some presentations 
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This approach leads to a pictorial formulation of quantum information processing. It shares 
similarities with the "quantum pictorialism" logic formalism, more based on category theory, 
and presented for instance in IICoelOL 

It can also be viewed as an operational and informational extension of the convex set ap- 
proach (developed notably by G. Ludwig, see [Lud85 | and [AulOl |,[BC81 1 for details). This last 
approach puts more emphasis on the concept of states than on observables in QM. 

I shall not discuss in any details these approaches. Let me just highlight amongst the most 
recent attempts those of Hardy [HarOlJ IHarlll and those of Chiribella, D'Ariano & Perinotti 
MCDPlCHlCDPllI ( see [Brull j for a short presentation of this last formulation). See also [MM11 j. 
In ||CDP11| the standard complex Hilbert space formalism of QM is derived from 6 informa- 
tional principles: Causality, Perfect Distinguishability, Ideal Compression, Local Distinguisha- 
bility, Pure conditioning, Purification principle. 

The first 2 principles are not very different from the principles of other formulations (causal- 
ity is defined in a standard sense, and distinguishability is related to the concept of differen- 
tiating states by measurements). The third one is related to existence of reversible maximally 
efficient compression schemes for states. The four and the fifth are about the properties of bi- 
partite states and for instance the possibility to performing local tomography and the effect of 
separate atomic measurements on such states. The last one, about "purification" distinguishes 
quantum mechanics from classical mechanics, and states that any mixed state of some system 
S may be obtained from a pure state of a composite larger system S + S'. See [Brull J for a 
discussion of the relation of this last purification principle with the discussions of the "cut" 
between the system measured and the measurement device done for instance by Heisenberg 
in |CB], but see of course the previous discussion by von Neumann in [vN32|. 

5.2 Quantum correlations 

The world of quantum correlations is richer, more subtle and more interesting than the 
world of classical ones. Most of the puzzling features and seeming paradoxes of quantum 
physics come from these correlations, and in particular from the phenomenon of entanglement. 
Entanglement is probably the distinctive feature of quantum mechanics, and is a consequence 
of the superposition principle when considering quantum states for composite systems. Here 
I discuss briefly some basic aspects. Entanglement describes the particular quantum corre- 
lations between two quantum systems which (for instance after some interactions) are in a 
non separable pure state, so that each of them considered separately, is not in a pure state 
any more. Without going into history, let me remind that if the terminology "entanglement" 
("Verschrankung") was introduced in the quantum context by E. Schrodinger in 1935 (when 
discussing the famous EPR paper). However the mathematical concept is older and goes back 
to the modern formulation of quantum mechanics. For instance, some peculiar features of en- 
tanglement and its consequences have been discussed already around the 30' in relation with 
the theory of quantum measurement by Heisenberg, von Neumann, Mott, etc. Examples of 
interesting entangled many particles states are provided by the Stater determinant for many 
fermion states, by the famous Bethe ansatz for the ground state of the spin 1/2 chain, etc. 

5.2.1 Entropic inequalities 

von Neumann entropy: The difference between classical and quantum correlations is already 
visible when considering the properties of the von Neumann entropy of states of composite 
systems. Remember that the von Neumann entropy of a mixed state of a system A, given by a 
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density matrix p A , is given by 

S(p A ) = -tr(p A \ogp A ) (5.2.1) 
In quantum statistical physics, the log is usually the natural logarithm 

log = log e = In (5.2.2) 

while in quantum information, the log is taken to be the binary logarithm 

log = log 2 (5.2.3) 

The entropy measures the amount of "lack of information" that we have on the state of the 
system. But in quantum physics, at variance with classical physics, one must be very careful 
about the meaning of "lack of information", since one cannot speak about the precise state of a 
system before making measurements. So the entropy could (and should) rather be viewed as a 
measure of the number of independent measurements we can make on the system before hav- 
ing extracted all the information, i.e. the amount of information we can extract of the system. 
It can be shown also that the entropy give the maximum information capacity of a quantum 
channel that we can build out of the system. See MNC101 for a good introduction to quantum 
information and in particular on entropy viewed from the information theory point of view. 
When no ambiguity exists on the state p A of the system A, I shall use the notations 

S A = S(A) = S(p A ) (5.2.4) 

The von Neumann entropy shares many properties of the classical entropy. It has the same 
convexity properties 

S[Ap + (1 - X)p'] > AS[p] + (1 - A)S[p'] , 0<A<1 (5.2.5) 

It is minimal S = for systems in a pure state and maximal for systems in a equipartition state 
S = log(N) if p = jjIn- It is extensive for systems in separate states. 



Relative entropy: The relative entropy (of a state p w.r.t. another state c for the same system) 
is defined as in classical statistics (Kullback-Leibler entropy) as 

S(p\\o) =tr(plogp)-tr(plog<r) (5.2.6) 

with the same convexity properties. 

The differences with the classical entropy arise for composite systems. For such a system 
AB, composed of two subsystems A and B, a general mixed state is given by a density matrix 
p A B on H A b = T~La <8> T-Lb- The reduced density matrices for A and B are 

p A = tr B (p AB ) , p B = tr A (p AB ) (5.2.7) 

This corresponds to the notion of marginal distribution w.r.t. A and B of the general probability 
distribution of states for AB in classical statistics. Now if one considers 

S(AB) = -\x(p A B log P A b) , S{A) = -tr(p A \ogp A ) , S(B) = -tr(p B log p B ) 

(5.2.8) 

one has the following definitions. 
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Conditional entropy: The conditional entropy S(A I B) (the entropy of A conditional to B in the 
composite system AB) is 

S(A\B) = S{AB) - S(B) (5.2.9) 

The conditional entropy S (A | B) corresponds to the remaining uncertainty (lack of information) 
on A if B is known. 

Mutual information: The mutual information (shared by A and B in the composite system AB) 

S(A : B) = S(A) + S(B) — S(AB) (5.2.10) 

Subadditivity: The entropy satisfies the general inequalities (triangular inequalities) 

\S(A) - S(B)\ < S(AB) < S(A) + S(B) (5.2.11) 

The rightmost inequality S(AB) < S(A) + S(B) is already valid for classical systems, but the 
leftmost is quantum. Indeed for classical systems the classical entropy H c \ satisfy only the much 
stronger lower bound 

max(H d (A),H d (B)) < H d (AB) (5.2.12) 

Subadditivity implies that if AB is in a pure entangled state, S(A) = S(B). It also implies 
that the mutual information in a bipartite system is always positive 

S(A : B) > (5.2.13) 

In the classical case the conditional entropy is always positive H d (A\B) > 0. In the quan- 
tum case the conditional entropy may be negative S(A|B) < if the entanglement between A 
and B is large enough. This is a crucial feature of quantum mechanics. If S(A\B) < it means 
that A and B share information resources (through entanglement) which get lost if one gets 
information on B only (through a measurement on B for instance). 



Strong subadditivity: Let us consider a tripartite systems ABC. The entropy satisfies another 
very interesting inequality 

S(A) + S(B) <S{AC)+S{BC) (5.2.14) 
It is equivalent to (this is the usual form) 

S(ABC)+S(C) <S(AC) + S(BC) (5.2.15) 



Note that 5.2.14 is also true for the classical entropy, but then for simple reasons. In the quantum 
case it is a non trivial inequality. 

The strong subadditivity inequality implies the triangle inequality for tripartite systems 

S{AC) < S(AB) + S(BC) (5.2.16) 



so the entropic inequalities can be represented graphically as in fig. 5.3 



The strong subadditivity inequality has important consequences for conditional entropy 
and mutual information (see |NC10|). Consider a tripartite composite system ABC. It implies 
for instance 

S(C\A) +S(C\B) > (5.2.17) 

and 

S(A\BC) < S{A\B) (5.2.18) 
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Figure 5.3: Entropic inequalities: the length of the line "X" is the von Neumann entropy S(X). 
The tetrahedron has to be "oblate", the sum AC+BC (fat red lines) is always > the sum A+B 
(fat blue lines). 



which means that conditioning A to a part of the external subsystem (here C inside BC) increase 
the information we have on the system (here A). One has also for the mutual information 

S(A : B) < S{A : BC) (5.2.19) 

This means that discarding a part of a multipartite quantum system (here C) increases the mu- 
tual information (here between A and the rest of the system). This last inequality is very impor- 
tant. It implies for instance that if one has a composite system AB, performing some quantum 
operation on B without touching to A cannot increase the mutual information between A and 
the rest of the system. 

Let us mention other subadditivity inequalities for tri- or quadri-partite systems. 

S(AB\CD) < S{A\C) + S(B\D) (5.2.20) 
S(AB\C) < S{A\C) + S(B\C) (5.2.21) 
S(A\BC) < S{A\B) + S(A\C) (5.2.22) 



5.2.2 Bipartite correlations: 

The specific properties of quantum correlations between two causally separated systems are 
known to disagree with what one would expect from a "classical picture" of quantum theory 
where the quantum probabilistics features come just from some lack of knowledge of underly- 
ing "elements of reality". I shall come back later on the very serious problems with the "hidden 
variables" formulations of quantum mechanics. But let us discuss already some of the proper- 
ties of these quantum correlations in the simple case of a bipartite system. 

I shall discuss briefly one famous and important result: the Tsirelson bound. The general 
context is that of the discussion of non-locality issues and of Bell's [Bel64| and CHSH inequali- 
ties [CHSH69 1 in bipartite systems. However, since these last inequalities are more of relevance 



when discussing hidden variables models, I postpone their discussion to the next section 5.3 
This presentation is standard and simply taken from BLall2| . 



5.2.2.a - The Tsirelson Bound 

The two spin system: Consider a simple bipartite system consisting of two spins 1 /2, or q-bits 1 
and 2. If two observers (Alice A and Bob B) make independent measurements of respectively 
the value of the spin 1 along some direction n\ (a unit vector in 3D space) and of the spin 2 
along n-2, at each measurement they get results (with a correct normalization) +1 or —1. Now 
let us compare the results of four experiments, depending whether A choose to measure the 
spin 1 along a first direction aora second direction a', and wether B chose (independently) to 
measure the spin 2 along a first direction b or a second direction V '. Let us call the corresponding 
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observables A, A', B, B', and by extension the results of the corresponding measurements in a 
single experiment A and A' for the first spin, B and B' for the second spin. 

spin 1 along a -> A = ±1 ; spin 1 along a' A' = ±1 (5.2.23) 

spin 2 along b -> B = ±1 ; spin 2 along V -> B ! = ±1 (5.2.24) 

Now consider the following combination M of products of observables, hence of products 
of results of experiments 

M = AB-AB' + A'B + A'B' (5.2.25) 

and consider the expectation value (M)^ of M for a given quantum state \ip) of the two spins 
system. In practice this means that we prepare the spins in state \tp), chose randomly (with 
equal probabilities) one of the four observables, and to test locality A and B may be causally 
deconnected, and choose independently (with equal probabilities) one of their own two observ- 
ables, i.e. spin directions. Then they make their measurements. The experiment is repeated a 
large number of time and the right average combination M of the results of the measurements 
is calculated afterwards. 

A simple explicit calculation shows the following inequality, known as the Tsirelson bound 
llCirSOl 



Tsirelson bound: For any state and any choice orientations a, a', b and b' , one has 

|(M)|<2v / 2 (5.2.26) 

while, as discussed later, "classically", i.e. for theories where the correlations are described by 
contextually-local hidden variables attached to each subsystem, one has the famous Bell-CHSH 
bound 

(\M\)« classic3l n < 2. (5.2.27) 
The Tsirelson bound is saturated if the state \ tp) for the two spin is the singlet 

\ f) = (singlet) = -L (| |) ® U) - \i) ® | t)) (5.2.28) 

and the directions for a, a', b and b' ar e co planar, and such that a _L a', b _L b', and the angle 
between a and b is n/4, as depicted on 



5.4 



5.2.2.b - Popescu-Rohrlich boxes 

Beyond the Tsirelson bound ? Interesting questions arise when one consider what could hap- 
pen if there are "super-strong correlations" between the two spins (or in general between two 
subsystems) that violate the Tsirelson bound. Indeed, the only mathematical bound on M for 
general correlations is obviously | (M) | < 4. Such hypothetical systems are considers in the 
theory of quantum information and are denoted Popescu-Rohrlich boxes [PR94) . With the 
notations of the previously considered 2 spin system, BR-boxes consist in a collection of prob- 
abilities P(A, B\a, b) for the outputs A and B of the two subsystems, the input or settings a and 



b being fixed. The {a, b) correspond to the settings I and the (A, B) to the outputs O of fig. 5.1 
of the quantum information section. In our case we can take for the first spin 

a = 1 — > chose orientation a , a = — 1 — » chose orientation a' (5.2.29) 
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n a 




a' 



Figure 5.4: Spin directions for saturating the Tsirelson bound and maximal violation of the 
Bell-CHSH inequality 



and for the second spin 

b = 1 — > chose orientation b , b = — 1 — > chose orientation V (5.2.30) 
The possible outputs being always A = ±1 and B = ±1. 





< > 

A 


B 








a 


< 

b 



Figure 5.5: a Popescu-Rohrlich box 



The fact that the P(A, B \a, b) are probabilities means that 

< P(A,B\a,b) < 1 , Y^P(A,B\a,b) = 1 for affixed (5.2.31) 

A,B 

Non signalling: If the settings a and b and the outputs A and B are relative to two causally 
separated parts of the system, corresponding to manipulations by two independent agents 
(Alice and Bob), enforcing causality means that Bob cannot guess which setting (a or a') Alice 
has chosen from his choice of setting (b and b') and his output (B or B), without knowing Alices' 
output A. The same holds for Alice with respect to Bob. This requirement is enforced by the 
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non-signaling conditions 

Y^P(A, B\a, b) = £ P(A, B\a', b) (5.2.32) 
Y^P(A,B\a,b) P(A, B\a,b') (5.2.33) 

B B 

A remarkable fact is that there are choices of probabilities which respect the non-signaling 
condition (hence causality) but violate the Tsirelson bound and even saturate the absolute 
bound | (M) | = 4 . Such hypothetical devices would allow to use "super-strong correlations" 
(also dubbed "super-quantum correlations") to manipulate and transmit information in a more 
efficient way that quantum systems (in the standard way of quantum information protocols, by 
sharing some initially prepared bipartite quantum system and exchanges of classical informa- 
tion) [BBL + 06| [vD05| [PPK + 09| . However, besides these very intriguing features of "trivial 
communication complexity", such devices are problematic. For instance it seems that no inter- 
esting dynamics can be defined on such systems [GMCDIOj. 

5.3 The problems with hidden variables 

5.3.1 Hidden variables and "elements of reality" 

In this section I discuss briefly some features of quantum correlations which are important 
when discussing the possibility that the quantum probabilities may still have, to some extent, a 
"classical interpretation" by reflecting our ignorance of inaccessible "sub-quantum" degrees of 
freedom or "elements of reality" of quantum systems, which could behave in a more classical 
and deterministic way. In particular a question is: which general constraints on such degrees 
of freedom are enforced by quantum mechanics? 

This is the general idea of the "hidden variables" program and of the search of explicit hid- 
den variable models. These ideas go back to the birth of quantum mechanics, and were for 
instance proposed by L. de Broglie in his first "pilot wave model", but they were abandoned 
by most physicist after 1927 Solvay Congress and the advances of the 1930' , before experienc- 
ing some revival and setbacks in the I960', from the works by Bohm and de Broglie, and the 
discussions about locality and Bell-like inequalities. 

The basic idea is that when considering a quantum system S, its state could be described by 
some (partially or totally) hidden variables t> in some space 33, with some unknown statistics 
and dynamics. Each o may represent a (possibly infinite) collection of more fundamental vari- 
ables. But they are such that the outcome of a measurement operation of a physically accessible 
observables A is determined by the hidden variable d. 

mesurement of A — > outcome a = f(A, o) (a real number) (5.3.1) 

Quantum undeterminism should come from our lack of knowledge on the exact state of the 
hidden variables. In other word, the pure quantum states \ip) of the system should correspond 
to some classical probability distribution p</,(o) on V. Of course a measurement operation could 
back react on the hidden variables t>. 

This is probably an oversimplified presentation of the idea, since there are several versions 
and models. But for instance in the hidden variable model of de Broglie and Bohm (for a single 
particle obeying the Schrodinger equation), the hidden variable o = (ip, x), where xp = {ip{y)} 
is the whole "pilot" wave function, and x the position of the particle. 

In its simplest version, one could try to consider hidden variables (element of reality) that 
are in one to one correspondence with the possible outcomes a of all the observables A of the 
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system, and in particular which obeys the addition law 

C = A + B => c = a + b i.e /(C,t>) =/(A,t>) +/(B,t>) (5.3.2) 

This possibility is already discussed by J. von Neumann in his 1932 book ||vN32tlvN55l , where 
it is shown to be clearly inconsistent. Indeed if A and B do not commute, the possible outcomes 
of C (the eigenvalues of the operator C) are not in general sums of outcomes of A and B (sums 
of eigenvalues of A and B), since A and B do not have common eigenvectors. See [BublOj for a 
detailed discussion of the argument and of its historical significance. 



5.3.2 Context free hidden variables ? 

Hidden variable models have been rediscussed a few decades later, from a more realistic 
point of view, in particular by J. Bell. In a modern language, the models considered are "con- 
text free" or "non contextual" hidden variables models. The idea is that one should consider 
only the correlations between results of measurements on a given system for sets of commut- 
ing observables. Indeed only such measurements can be performed independently and in any 
possible order (on a single realization of the system), and without changing the statistics of the 
outcomes. Any such given set of observables can be thought as a set of classical observables, 
but of course this classical picture is not consistent from one set to another. 

Thus the idea is still that a hidden variable assigns to any observable A an outcome a = 



f(A, t>) as in 5.3.1 This assumption is often called "value definiteness" (VD). 



However the very strong constraint 5.3.2 should be replaced by the more realistic constraint 
for the set of outcomes {f{A, o) 



if A and B commute, then < 



[ f(A + B, o) = f(A, d) + f(B, o) 

and (5.3.3) 
{ f{AB,X>)=f(A,x>)f{B,X>) 



Moreover, these conditions are extended to any family T = {Aj, i = 1,2, • • • } of commuting 
operators. 

Here I consider purely deterministic HV. This means that the assignement A — > a = f(A, d) 
is unique, and thus in QM a is one of the eigenvalues of the operator A. 

The term "context free" means that the outcome a for the measurement of the first observ- 
able A is supposed to be independent of the choice of the second observable B. In other word, 
the outcome of a measurement depends on the hidden variable, but not of the "context" of the 
measurement, that is of the other quantities measured at the same time. 

We shall discuss the possibility that a is a random variable (with a law fixed by o) later. 



5.3.3 Gleason's theorem and contextuality 

These kind of models seem much more realistic. However, they are immediatly excluded 
by Gleason's theorem MGle571 , as already argued by J. Bell in |Bel66|. 
Indeed, if to any x> is associated a function f v , defined as 

/ B ; A — >- /o(A) = f(A, t)) (5.3.4) 



which satisfy the consistency conditions 5.3.3 this is true in particular for any family of com- 
muting projectors {Pi}, whose outcome in or 1 

P projector such that P = P + = P 2 /„(P) = or 1 (5.3.5) 
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In particular, this is true for the family of projectors {P,} onto the vector of any orthonormal 
basis {?, } of the Hilbert space H of the system. This means simply that defining the function / 
on the unit vectors e by 

/00=/ B (P r ) , P?=|2><*l (5-3-6) 
(remember that is considered fixed), this function must satisfy for any orthonormal basis 

{ej} orthonormal basis => ^/(e,) = 1 (5.3.7) 

i 

while we have for any unit vector 

f(e) = Oorl (5.3.8) 



This contradicts strongly Gleason's theorem (see 



4.4.2) , as soon as the Hilbert space of the sys- 



tem H has dimension dim('H) > 3! Indeed, 5.3.7 means that the function / is a frame function 
(in the sense of Gleason), hence is continuous, while 5.3.8 (following from the fact that / is 
function on the projectors) means that / cannot be a continuous function. So 

dim('H) > 3 no context-free HV can describe all the quantum correlations (5.3.9) 

Gleason's theorem is a very serious blow to the HV idea. However, some remaining possibili- 
ties can be considered, for instance: 

1. There are still context-free HV, but they describe only some specific subset of the quantum 
correlations, not all of them. 

2. There are HV, but they are fully contextual. 

We now discuss two famous cases where the first option has been explored, but appears to be 
still problematic. The second one raises also big questions, that will be shortly discussed in 
MM 



5.3.4 The Kochen-Specker theorem 

The first option is related to the idea that some subset of the correlations of a quantum 
system have a special status, being related to some special explicit "elements of reality" (the 
"be-ables" in the terminology of J. Bell), by contrast to the ordinary observables which are just 
"observ-ables". Thus a question is whether for a given quantum system there are finite families 
of non commuting observables which can be associated to context-free HV. 

In fact the problems with non-contextual HV have been shown to arise already for very 
small such subsets of observables, first by S. Kochen and E. Specker [KS67|. These issues started 
to be discussed by J. Bell in [Bel66|. This is the content of the Kochen-Specker theorem. This 
theorem provides in fact examples of finite families of unit vectors £ = {e, } in a Hilbert Space 
Ti (over R or C) of finite dimension (dim('H) = n), such that it is impossible to find any frame 
function such that 

n 

/(<?;) = or 1 and ■ ■ • , e, n ) orthonormal basis => X^/(A,) = 1 (5.3.10) 

«=1 

The original example of |KS67| involved a set with 117 projectors in a 3 dimensional Hilbert 
space and is a very nice example of non-trivial geometry calculation. Simpler examples in di- 
mension n = 3 and n = 4 with less projectors have been provided by several authors (Mermin, 
Babello, Peres, Penrose). 

I do not discuss more these examples and their significance. But this shows that the non- 
contextual character of quantum correlations is a fundamental feature of quantum mechanics. 
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5.3.5 Bell / CHSH inequalities and non-locality 

Another important situation where non-contextuality is explored, in relation with locality, 
is found in the famous 1964 paper by J. Bell | Bel64 1 . Consider a bipartite system S consisting of 
two causally separated subsystems S\ and S 2 , for instance a pair of time-like separated photons 
in a Bell-like experiment. One is interested in the correlations between the measurements that 
are performed independently on Si and S 2 . Any pair of corresponding observables A and B (or 
more exactly A <g> 1 and 1 ® B) commute, and thus one expect that the result of a measurement 
on tSi, if it depends on some HV, should not depend on the measurement made on £2- In other 
word, the result of a measurement on Si should not depend on the context of £2- The reciprocal 
statement being true as well. 

Thus, following Bell, let us assume that some HV's underlie the bipartite system S, and 
that it is local in the sense that it is 5i-versus-c>2 context free. But it may not - and in fact it 
cannot - be context-free with respect to Si or £2 only. This means that a given HV should 
determine separately the relation observable A — > outcome a for Si and observable B — > outcome b 
for S 2 . In other word, such a hidden variable assigns a pair of probability distributions for all 
the observables relative to Si and £2 

i-> ( Pl (a\A),p 2 {b\B)) (5.3.11) 

The function P i(a\A) give the probability for the outcome a when measuring A on Si, the 
function p\ (a\ A) the probability for the outcome b when measuring B on £2- 

One may assume that these probabilities can be decomposed into subprobabilities associ- 
ated to local hidden variables roi and tt)2 for the two subsystems Si and £2- In this case is 
itself a pair of probability distribution (qi, q 2 ) over the roi's and tv 2 's respectively. 

e = (qi,q2) , qi- tfi^iOi), qi - »t>2 -Mi(tt>2) (5.3.12) 

while it is the HV roi (respectively xv 2 ) that determines the outcome A — > a (respectively B — > b). 
These HV,s have to be contextual if one wants the relations A — > a and B — > b to be consistent 
with quantum mechanics for the two subsystems. 

But one may also take the probability distributions p\(a\A) and p 2 (b\B) to be fully quantum 
mechanical, thus corresponding, using Gleason's theorem, to some density matrices pi and p 2 

v = ( P i,p 2 ) (5.3.13) 

such that P i(a\A) = tr(S(a - A)p x ) and p 2 (b\B) = tr(5(b - B)p 2 ). 



In any case, hidden variables of the form 5.3.11 are denoted "local hidden variables". One 



might perhaps rather call them "locally-contextual-only hidden variables" but let us keep the 
standard denomination. 

A quantum state ip of S corresponds to some probability distribution q(b) over the HV's 0. 
(7(0) represent our ignorance about the "elements of reality" of the system. If this description 
is correct, the probability for the pair of outcomes (A, B) — > (a, b) in the state ip is given by the 
famous representation 

p(a,b\A,B) = '£q{p)pi{a\A)p2(Jb\B) (5.3.14) 

It is this peculiar form which implies the famous Bell and BHSH inequalities on the correlations 
between observables on the two causally independent subsystems. Let us repeat the argument 
for the CHSH inequality. If we consider for observables for Si (respectively £2) two (not nec- 
essarily commuting) projectors Pi and P[ (respectively Q 2 and Q' 2 ), with outcome or 1, and 
redefine them as 

A = 2Pi — 1 A' = 2P{ — 1 B = 2Qi - 1 B 1 = 2Q[ - 1 (5.3.15) 
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so that the outcomes are —1 or 1, if one perform a series of experiments on an ensemble of 
independently prepared instances of the bipartite system S, choosing randomly with equal 
probabilities to measure (A,B), (A',B), (A, B') or (A', B'), and combine the results to compute 
the average 

(M) = (AB) - (AB') + (A'B) + (A'B') (5.3.16) 



the same argument than in 5.2.2 using the general inequality 

a, a', b,b' G [-1,1] a(b - b') + a'{b + b') e [-2,2] (5.3.17) 

implies the CSHS inequality 

- 2 < (M) < 2 (5.3.18) 

This inequality is known to be violated for some quantum states (entangled states) and some 
choice of observables. Indeed (M) may saturate the Tsirelon's bound | (M) | < 2 a/2. The reason 
is simple. Assuming that all quantum states give probabilities of the form 5.3.14 and that the 
probabilities p\(a\A) and p2(b\B) obey the quantum rules and are representable by density 
matrices means that any quantum state (mixed or pure) ip can be represented by a density 
matrix of the form 

p = Y,^)p^)®p^) ( 5 - 3 - 19 ) 

c 

Such states are called separable states. But not all states are separable. For a bipartite system, 
this is the case indeed for pure entangled states. 

I do not discuss the many and very interesting generalizations and variants of Bell inequal- 
ities (for instance the spectacular GHZ example for tripartite systems) and the possible conse- 
quences and tests of non-contextuality. 

I do not review either all the experimental tests of violations of Bell-like inequalities in 
various contexts, starting from the first experiments by Clauser, and those by Aspect et al., up 
to the most recent ones. They are in full agreement with the predictions of standard Quantum 
Mechanics and more precisely of Quantum Electro Dynamics. See for instance [Lall2j or a 
recent and very complete review. 

5.3.6 Discussion 

The significance and consequences of Bell and CHSH inequalitys and of the Kochen-Specker 
theorem have been enormously discussed, and some debates are still going on. To review and 
summarize these discussions is not the purpose of these notes. Let me just try to make some 
simple remarks. 

The assumption of context-free value definiteness is clearly not tenable, from Gleason's 
theorem. This means that one must be very careful when discussing quantum physics about 
correlations between results of measurements. To quote a famous statement by Peres: "Unper- 
formed experiments have no results" |Per78|. 

Trying to assign some special ontological status to a (finite and in practice small) number 
of observables to avoid the consequence of the Kochen-Specker theorem may be envisioned, 
but raises other problems. For instance, if one wants to keep the main axioms of QM, and non- 
contextuality, by using a finite number of observables, one would expect the quantum logic 
formalism would lead to QM on a finite division ring (a Galois field), but it is known that this 



is not possible (see the discussion in 4.3.21. Note however that relaxing some basic physical 
assumptions like reversibility and unitarity has been considered for instance in [tH07|. 

It is also clear that non-local quantum correlations are present in non-separable quantum 
states, highlighted by the violations of Bell's and CHSH-like inequalities (and their numerous 
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and interesting variants). They represent some of the most non-classical and counter-intuitive 
features of quantum physics. In connexion with the discussions of the "EPR-paradox", this 
non-local aspect of quantum physics has been often - and is still sometimes -presented as 
a contradiction between the principles of quantum mechanics and those of special relativity. 
This is of course not the case. These issues must be discussed in the framework of relativistic 
quantum field theory where the basic objects are quantum fields, not (first quantized) parti- 
cles (or classical fields). See the section 3.8.1 In this formalism a quantum state of a field is 



(some kind of) wave function over fields configurations over the entire space, and is intrinsi- 
cally a non-local and non-separable object. The physical requirements of causality, and locality, 
implying no faster-than-light signaling (or any kind of real "spooky-at-a-distance action"), are 
requirements on the observables, i.e. on the self-adjoint operators of the theory. 



Finally, the option (2) at the end of 5.3.3 - There are hidden variables but that they are fully con- 
textual - is also very problematic and raises more questions than solutions (in my opinion). For 
instance, I would expect that even assuming non-contextual value definiteness, the sum and 



product relations |5.3.3 should still holds for commuting observables with fully non-degenerate 
spectrum. Then a problem of definiteness arises when considering a projector as a limit of 
such observables (in some sense the result of a measurement should depend not only on all 
the measurements you can perform, but on those you will not perform). Another problem is 
that contextuality leads to consider that there are non-local hidden correlation between the sys- 
tem and the measurement apparatus before any measurement, which in some sense pushes the 
problem one rug further without really solving it. Nevertheless, contextuality has been consid- 
ered by several authors in connexion with some interpretations of quantum mechanics like the 
so called "modal interpretations". I am however unable to discuss this further. 



To summarize the discussions of these last two sections 5.2 and |5.3| Contrary to classical 
physics, there is an irreducible quantum uncertainty in the description of any quantum system. 
Not all its physical observable can be characterized at the same time. This is of course the 
uncertainty principle. Contrary to a simple reasoning, this does not mean that a quantum 
system is always more uncertain or "fuzzy" than a classical system. Indeed, the quantum 
correlations are stronger than the classical correlations, as exemplified by the quantum entropic 
inequalities 5.2.11 and 5.2.14 and the Tsirelson bound 5.2.26|co mpared to their classical analog, 



the entropic bound 5.2.12 and the B-CHSH inequality 5.3.18 This can be represented by the 



little drawing of Fig. 5.6 This is why the results by J. Bell and the subsequent ones turned out to 



have a long term impact. They contributed to the realization of what is not quantum mechanics, 
and to the rise of quantum information: using quantum correlations and entanglement, it is 
possible to transmit and manipulate information, perform calculations, etc. in ways which are 
impossible by classical means, and which are much more efficient. 



5.4 Measurements 

5.4.1 What are the questions? 

Up to now I have not discussed much the question of quantum measurements. I simply 
took the standard point of view that (at least in principle) ideal projective measurements are 
feasible and one should look at the properties of the outcomes. The question is of course highly 
more complex. In this section I just recall some basic points about quantum measurements. 

The meaning of the measurement operations is at the core of quantum physics. It was 
considered as such from the very beginning. See for instance the proceedings of the famous 
Solvay 1927 Congress |BV12| , and the 1983 review by Wheeler and Zurek BWZ83I . Many great 
minds have thought about the so called "measurement problem" and the domain has been 
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Figure 5.6: Schematic of the worlds of classical correlations, quantum correlations and "super- 
strong" unphysical correlations 



revived in the last decades by the experimental progresses, which allows now to manipulate 
simple quantum system and implement effectively ideal measurements. 

On one hand, quantum measurements represent one of the most puzzling features of quan- 
tum physics. They are non-deterministic processes (quantum mechanics predicts only prob- 
abilities for outcomes of measurements). They are irreversible processes (the phenomenon of 
the "wave-function collapse"). They reveal the irreducible uncertainty of quantum physics (the 
uncertainty relations). This makes quantum measurements very different from "ideal classical 
measurements" . 

On the other hand, quantum theory is the first physical theory that addresses seriously the 
problem of the interactions between the observed system (the object) and the measurement ap- 
paratus (the observer). Indeed in classical physics the observer is considered as a spectator, able 
to register the state of the real world (hence to have its own state modified by the observation), 
but without perturbing the observed system in any way. Quantum physics shows that this 
assumption is not tenable. Moreover, it seems to provide a logically satisfying answer]^] to the 
basic question: what are the minimal constraints put on the results of physical measurements 
by the basic physical principles^] 

It is often stated that the main problem about quantum measurement is the problem of the 
uniqueness of the outcome. For instance, why do we observe a spin 1/2 (i.e. a q-bit) in the 
state |t) or in the state \\) when we start in a superposition \tp) = + j8|4,)? However by 
definition a measurement is a process which gives one single classical outcome (out of several 
possible). Thus in my opinion the real questions, related to the question of the "projection pos- 
tulate", are: (1) Why do repeated ideal measurements should give always the same answer? (2) 
Why is it not possible to "measure" the full quantum state | xp) of a q-bit by a single measurement 
operation, but only its projection onto some reference frame axis? 

Again, the discussion that follows is very sketchy and superficial. A good recent reference, 
both on the history of the "quantum measurement problem", a detailed study of explicit dy- 
namical models for quantum measurements, and a complete bibliography, is the research and 
review article MABN12I . 



2. If not satisfying every minds, every times... 

3. Well... as long as gravity is not taken into account! 
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5.4.2 The von Neumann paradigm 

The general framework to discuss quantum measurements in the context of quantum the- 
ory is provided by J. von Neumann in his 1932 book ||vN32[|v"N55| . Let me present it on the 
simple example of the q-bit. 

But before, let me insist already on the fact that this discussion will not provide a derivation 
of the principle of quantum mechanics (existence of projective measurements, probabilistic 
features and Born rule), but rather a self-consistency argument of compatibility between the 
axioms of QM about measurements and what QM predicts about measurement devices. 

An ideal measurement involves the interaction between the quantum system S (here a q- 
bit) and a measurement apparatus Ai which is a macroscopic object. The idea is that Ai must 
be treated as a quantum object, like S. An ideal non destructive measurement on S that does 
not change the orthogonal states \~[) and \\) of S (thus corresponding to a measurement of 
the spin along the z axis, S z ), correspond to introducing for a finite (short) time an interaction 
between S and Ai, and to start from a well chosen initial state \I) for Ai. The interaction and 
the dynamics of Ai must be such that, if one starts from an initial separable state where S is in 
a superposition state 

\y)=0L\-\)+p\\) (5.4.1) 

after the measurement (interaction) the whole system (object+apparatus) is in an entangled 
state 

|t/>)®|!) -> «|t)®|F+>+j8|4,)®|P_) (5.4.2) 
The crucial point is that the final states \F + ) and |F_) for Ai must be orthogonal^ 

(F + \F-)=0 (5.4.3) 

Of course this particular evolution 5.4.1 is unitary for any choice of \tp), since it transforms a 
pure state into a pure state. 

\ip)®\I) -> « It) ® \F+) + \i) ® \F-) (5.4.4) 

One can argue that this is sufficient to show that the process has all the characteristic 
expected from an ideal measurement, within the quantum formalism itself. Indeed, using 
the Born rule, this is consistent with the fact that the state a|y) is observed with probability 
p + = \oc\ 2 and the state oc\\) with probability p_ = |/5| 2 . Indeed the reduced density matriices 
both for the system S and for the system Ai (projected onto the two pointer states) is that of a 
completely mixed state 

PS = ("+ I ) (5.4,) 

For instance, as discussed in [vN32|[vN55|, if one is in the situation where the observer O, 
really observe the measurement apparatus Ai, not the system S directly, the argument can be 
repeated as 

\f)®\I)®\0) -> a|t)®|F+)®|0 + )+ j S|;)®|F-)®|0_) (5.4.6) 

and it does not matter if one puts the fiducial separation between object and observer between 
S and Ai + O or between S + Ai and O. This argument being repeated ad infinitum. 

A related argument is that once a measurement has been performed, if we repeat it using 
for instance another copy Ai' of the measurement apparatus, after the second measurement 
we obtain 

IV) ® |I> ® |f ) -> a|t>®|F+)®|F+>+^|4'>®|f->®|P!.> (5.4.7) 

4. as already pointed out in lvN32l 
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so that we never observe both |f) and \\) in a successive series of measurements (hence the 
measurement is really a projective measurements). The arguments holds also if the outcome 
of the first measurement is stored on some classical memory device T> and the measurement 
apparatus reinitialized to \I). This kind of argument can be found already in [M ot29L 

The discussion here is clearly outrageously oversimplified and very sketchy. For a precise 
discussion, one must distinguish among the degrees of freedom of the measurement apparatus 
M. the (often still macroscopic) variables which really register the state of the observed system, 
the so called pointer states, from the other (numerous) microscopic degrees of freedom of Ai, 
which are present anyway since Ai is a macroscopic object, and which are required both for 
ensuring decoherence (see next section) and to induce dissipation, so that the pointer states 
become stable and store in a efficient way the information about the result of the measure- 
ment. One must also take into account the coupling of the system S and of the measurement 
apparatus Ai to the environment £. 



5.4.3 Decoherence and ergodicity (mixing) 

As already emphasized, the crucial point is that starting from the same initial state \I), the 
possible final pointer states for the measurement apparatus, \F+) and |F_), are orthogonal. This 
is now a well defined dynamical problem, which can be studied using the theory of quantum 
dynamics for closed and open systems. The fact that Ai is macroscopic, i.e. that its Hilbert 
space of states in very big, is essential, and the crucial concept is decoherence (in a general sense). 

The precise concept and denomination of quantum decoherence was introduced in the 
70's (especially by Zeh) and developed and popularized in the 80's (see the reviews [JZ K + 03| , 
IIZur03l ). But the basic idea seems much older and for our purpose one can probably go back 
to the end of the 20' and to von Neumann's quantum ergodic theorem [vN29| (see |vN10B for 
the english translation and |GLM + 10 | for historical and physical perspective). 

One starts from the simple geometrical remark [vN29] that if \e\) and \&2) are two random 
unit vectors in a N dimensional Hilbert space Ti (real or complex), their average "overlap" 
(squared scalar product) is of order 

|(ei|e 2 )| 2 N = dim ( n ) ( 5A8 ) 

hence it is very small, and for all practical purpose equal to 0, if N is very large. Remember that 
for a quantum system made out of M similar subsystems, N <x (Nq) m , No being the number of 
accessible quantum states for each subsystem. 



A simple idealized model to obtain a dynamics of the form 5.4.4 for S + Ai is to assume 
that both S and Ai have no intrinsic dynamics and that the evolution during the interac- 
tion/ measurement time interval is given by a interaction Hamiltonian (acting on the Hilbert 
space U = U S ® U M of 5 + Ai) of the form 

Hint= |t)(t|®H + + |;)U|®H_ (5-4.9) 

where H + and H_ are two different Hamiltonians (operators) acting on T-Lm- It is clear that if 
the interaction between S and A4 takes place during a finite time t, and is then switched off, 



the final state of the system is an entangled one of the form 5.4.4 with 



\F+) =e^ H +\I) , |F_) = e^ H -\I) (5.4.10) 



so that 



F + |F_) = (7|e _ ffi w + .e« w -|I) (5.4.11) 



IPhT 2012 



Introduction to quantum formalism[s] 



5-18 



CHAPTER 5. ADDITIONAL DISCUSSIONS 



It is quite easy to see that if H + and H_ are not (too much) correlated (in a sense that I do not 
make more precise), the final states |F+) and |F_) are quite uncorrelated with respect to each 
others and with the initial state | J) after a very short time, and may be considered as random 
states in %m> so tnat 

l(f+|f-)| 2 - , «« 1 (5-4.12) 

so that for all practical purpose, we may assume that 

(F+|F_)=0 (5.4.13) 

This is the basis of the general phenomenon of decoherence. The interaction between the ob- 
served system and the measurement apparatus has induced a decoherence between the states 
| f) and | I) of S, but also a decoherence between the pointer states \F + ) and |F_) of M.. 

Moreover, the larger dim('H^( ), the smaller the "decoherence time" beyond which (F+ |F-) — 
is (and it is often in practice too small to be observable), and the larger (in practice in- 
finitely larger) the "quantum Poincare recurrence time" (where one might expect to get again 
|(F+|F_)|~l)is. 

Of course, as already mentionned, this is just the first step in the discussion of the dynamics 
of a quantum measurement. One has in particular to check and to explain how, and under 
which conditions, the pointer states are quantum microstates which correspond to macroscopic 
classical-like macrostates, which can be manipulated, observed, stored in an efficient way. At 
that stage, I just paraphrase J. von Neumann (in the famous chapter VI "Der Meliprozefi" of 
»vN32| ) 

"Die weitere Frage (...) soil uns dagegen nicht beschdftigen." 

Decoherence is a typical quantum phenomenon. It explains how, in most situations and 
systems, quantum correlations in small (or big) multipartite systems are "washed out" and 
disappear through the interaction of the system with other systems, with its environment or its 
microscopic internal degrees of freedom. Standard references on decoherence and the general 
problem of the quantum to classical transitions are |Zur9Ql and MSch07| . 

However, the underlying mechanism for decoherence has a well know classical analog: it is 
the (quite generic) phenomenon of ergodicity, or more precisely the mixing property of classical 
dynamical systems. I refer to textbooks such as [AA68J and [LL92] for precise mathematical 
definitions, proofs and details. Again I give here an oversimplified presentation. 

Let us consider a classical Hamiltonian system. One considers its dynamics on (a fixed en- 
ergy slice H = E of) the phase space Q , assumed to have a finite volume V = fi(Cl) normalized 
to V = 1, where y, is the Liouville measure. We denote T the volume preserving map Q — > Q 
corresponding to the integration of the Hamiltonian flow during some reference time fo- T k is 
the iterated map (evolution during time t = kto). This discrete time dynamical mapping given 
by T is said to have the weak mixing property if for any two (measurable) subsets A and B of Q 
one has 

-I Yl — 1 

lim -Vu(BnT k A) = u(B)u(A) (5.4.14) 

The (weak) mixing properties means (roughly speaking) that, if we take a random point a in 
phase space, its iterations = T k a are at large time and "on the average" uniformly distributed 
on phase space, with a probability p(B) / ji(Cl) to be contained inside any subset B G Q. See 



fig. 5.7 



Weak mixing is one of the weakest form of "ergodicity" (in a loose sense, there is a precise 
mathematical concept of ergodicity). 
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Figure 5.7: Graphical representation of the mixing property (very crude) 



Now in semiclassical quantization (for instance using Bohr-Sommerfeld quantization rules) 
if a classical system has M independent degrees of freedom (hence its classical phase space Q 
has dimension 2M), the "quantum element of phase space" SCI has volume SV = fi(SCl) = h M 
with h = 2nh the Planck's constant. If the phase space is compact with volume ]d(Cl) < oo the 
number of "independent quantum states" accessible to the system is of order N = ji(Cl) / }i(SCl) 
and should correspond to the dimension of the Hilbert space N = dim('H). In this crude 
semiclassical picture, if we consider two pure quantum states \a) and \b) and associate to them 
two minimal semiclassical subsets A and B of the semiclassical phase space Q, of quantum 
volume SV, the semiclassical volume }i(A Pi B) corresponds to the overlap between the two 
quantum pure states through 



}i(Ar\B) 



(5.4.15) 



More generally if we associate to any (non minimal) subset A of Q a mixed state given by a 
quantum density matrix p A we have the semiclassical correspondence 



ji(A n b) 



Ntr(p A p B ) 



(5.4.16) 



With this semiclassical picture in mind (Warning! It does not work for all states, only for 
states which have a semiclassical interpretation! But pointer states usually do.) the measure- 
ment/interaction process discussed above has a simple semiclassical interpretation, illustrated 



on fig. 5.8 



The big system M. starts from an initial state \l) described by a semiclassical element I. If 
the system S is in the state | f), Ai evolves to a state |F+) corresponding to F + . If it is in the 
state | f)/ -A4 evolves to a state |F_) corresponding to F_. For well chosen, but quite generic 
Hamiltonians H + and H_, the dynamics is mixing, so that, while }i{F+) = f(F-) = 1/N, typ- 
ically one has ji{F + n F_) = ^(F+)^(F-) = 1/N 2 1/N. Thus it is enough for the quantum 
dynamics generated by H + and H_ to have a quantum analog the classical property of mix- 
ing, which is quite generic, to "explain" why the two final states \F + ) and |F_) are generically 
(almost) orthogonal. 



5.4.4 Discussion 

As already stated, the points that I tried to discuss in this section represent only a small 
subset of the questions about measurements in quantum mechanics. Again, I refer for instance 
to BABN12I and l|Lall2l| (among many other reviews) for a serious discussion and bibliography. 
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Figure 5.8: Crude semiclassical and quantum pictures of the decoherence process 5.4.10 5.4.12 



I have not discussed more realistic measurement processes, in particular the so called "indi- 
rect measurements procedures", or "weak measurements", where the observations on the sys- 
tem are performed through successive interactions with other quantum systems (the probes) 
which are so devised as to perturb as less as possible the observed system, followed by stronger 
direct (in general destructive) measurements of the state of the probes. Such measurement pro- 
cesses, as well as many interesting questions and experiments in quantum physics, quantum 
information sciences, etc. are described by the general formalism of POVM's (Positive Operator 
Valued Measure). I do not discuss these questions here. 

In any case, important aspects of quantum measurements belong to the general class of 
problems of the emergence and the meaning of irreversibility out of reversible microscopic 
laws in physics (quantum as well as classical). See for instance [HPMZ96|. 

The quantum formalism as it is presented in these lectures starts (amongst other things) 
from explicit assumptions on the properties of measurements. The best one can hope is to 
show that the quantum formalism is consistent: the characteristics and behavior of (highly 
idealized) physical measurement devices, constructed and operated according to the laws of 
quantum mechanics, should be consistent with the initials axioms. 

One must be careful however, when trying to justify or rederive some of the axioms of 
quantum mechanics from the behavior of measurement apparatus and their interactions with 
the observed system and the rest of the world, not to make circular reasoning. 



5.5 Interpretations versus Alternative Theories 

In these notes I have been careful not to discuss the interpretation issues of quantum me- 
chanics. There are at least two reasons. 

1. These notes are focused on the mathematical formalism of "standard quantum mechan- 
ics". Thus I adopt the "operational" point of view^jthat quantum mechanics is a theo- 
retical framework which provides rules to compute the probabilities to obtain a given 
result when measuring some observable of a system in a given state. The concepts of 
"observables", "states" and "probabilities" being defined through the principles (axioms 
in a non-mathematical sense) of the formalisms considered. 



5. This is probably the point of view adopted by most physicists, chemists, mathematicians, computer scientists, 
engineers, ... who deal with the quantum world. 
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2. I do not feel qualified enough to discuss all the interpretations that have been proposed 
and all the philosophical questions raised by quantum physics since its birth. This does 
not mean that these question are unimportant. 

However, let me just make a few simple, and probably naive, remarks. 

Many interpretations of quantum mechanics do not challenge the present standard math- 
ematical formulations of the theory. They rather insist on a particular point of view or a par- 
ticular formulation of quantum mechanics as the best suited or the preferable one to consider 
and study quantum systems, and the quantum world. They may be considered as particular 
choices^] of point of view and of philosophical option to think about quantum mechanics and 
practice it. 

This is clearly the case for the so-called Copenhagen interpretations. They insist on the fact 
that QM deals only with predictions for results of operations, and they can be considered as 
"quantum mechanics from a strong pragmatist^Jpoint of view". Remember however that there 
is no clear cut definition of what a Copenhagen interpretation is. The term was introduced 
only in 1955 by Heisenberg. I refer to the paper by Howard [How04| for an historical and 
critical review of the history, uses and misuses of the concept. This is also the case for the 
"many worlds interpretations", that tries to take seriously the concept of "wave function of the 
universe". They can be considered (when used reasonably for physics) as the other extreme of 
"quantum mechanics from a strong realist^] point of view". Again there are many variants of 
these kind of interpretations. I refer to [DG73] for the original papers, and to [SBKW10| for a 
recent presentation of the subject and contradictory discussions. 

There is a whole spectrum of proposed interpretations, for instance the "coherent history 
formulations" and the "modal interpretations" . I do not discuss these interpretations here. 

The interpretations that rely on the mathematical formulations of quantum mechanics should 
be clearly distinguished]^] from another class of proposals to explain quantum physics that rely 
on modifications of the rules and are different physical theories. These modified or alternative 
quantum theories deviate from "standard" quantum mechanics and should be experimentally 
falsifiable (and sometimes are already falsified). 

This is the case of the various non-local hidden-variables proposals, such as the de Broglie- 
Bohm theory, which contain some variables (degrees of freedom) which do not obey the laws 
of QM, and which cannot be observed directly. One might think that they are not falsifiable, 
but remember that there are serious problems from contextuality which means that in general, 
if one want to keep non-contextuality not all physical (i.e. that can be measured) observables 
are expected to behave as QM predicts. 

This is also the case for the class of models known as "collapse models". See |GRW85. 
GRW86 1 for the first models. In these models the quantum dynamics is modified (for instance 
by non-linear terms) so that the evolution of the wave functions is not unitary any more (while 
the probabilities are conserved of course), and the "collapse of the wave function" is a dynami- 
cal phenomenon. These models are somehow phenomenological and of course not (yet?) fully 
internally consistent, since the origin of these non linear dynamics is quite ad hoc. They predict 
a breakdown of the law of QM for the evolution of quantum coherences and decoherence phe- 
nomenon at large times, large distances, or in particular for big quantum systems (for instance 
large molecules or atomic clusters). At the present day, despite the impressive experimental 
progresses in the control of quantum coherences, quantum measurements, study of decoher- 

6. This does not mean that I am an adept of some post-modern relativism... 

7. In the philosophical sense of pragmatism 

8. In the philosophical sense of realism 

9. This is unfortunately not always the case in popular - and even in some advanced - presentations and dis- 
cussions of quantum physics. 
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ence phenomenon, manipulation of information in quantum systems, etc. , no such violations 
of the predictions of standard QM and of unitary dynamics have been observed. 

5.6 What about gravity? 

Another really big subject that I do not discuss in these lecture is quantum gravity. Again 
just a few trivial remarks. 

It is clear that the principles of quantum mechanics are challenged by the question of quan- 
tizing gravity. The challenges are not only technical. General relativity (GR) is indeed a non- 
renormalizable theory, and from that point of view a first and natural idea is to consider it as 
an effective low energy theory. After all, in the development of nuclear and particle physics 
(in the 30', the 40', the 60'...) there have been several theoretical false alerts and clashes be- 
tween experimental discoveries and the theoretical understanding that led many great minds 
to question the principles of quantum mechanics. However QM came out unscathed and even 
stronger, and since the 70' its principles are not challenged any more. 

However with gravity the situation is different. For instance the discovery of the Bekenstein- 
Hawking entropy of black holes, of the Hawking radiation, and of the "information paradox" 
shows that fundamental questions remain to be understood about the relation between quan- 
tum mechanics and the GR concepts of space and time. Indeed even the most advanced quan- 
tum theories available, quantum field theories such as non-abelian gauge theories the standard 
model, its supersymmetric and /or grand unified extensions, still rely on the special relativity 
concept of space-time, or to some extend to the dynamical but still classical concept of curved 
space-time of GR. It is clear that a quantum theory of space time will deeply modify, and even 
abolish, the classical concept of space-time as we are used to. One should note two things. 

Firstly, the presently most advanced attempts to build a quantum theory incorporating 
gravity, namely string theory and its modern extensions, as well as the alternative approaches 
to build a quantum theory of space-time such as loop quantum gravity (LQG) and spin-foam 
models (SF), rely mostly on the quantum formalism as we know it, but change the fundamental 
degrees of freedom (drastically and quite widely for string theories, in a more conservative way 
for LQG/SF). The fact that string theories offers some serious hints of solutions of the informa- 
tion paradox, and some explicit solutions and ideas, like holography and AdS/ CFT dualities, 
for viewing space-time as emergent, is a very encouraging fact. 

Secondly, in the two formalisms presented here, the algebraic formalism and the quantum 
logic formulations, it should be noted that space and time (as continuous entities) play a sec- 
ondary role with respect to the concept of causality and locality/ separability. I hope this is 
clear in the way I choose to present the algebraic formalism in section [3] and quantum logic in 
section |4] Of course space and time are essential for constructing physical theories out of the 
formalism. Nevetheless, the fact that it is causal relations and causal independence between 
physical measurement operations that are essential for the formulation of the theory is also a 
very encouraging fact. 

Nevertheless, if for instance the information paradox is not solved by a quantum theory of 
gravity, or if the concepts of causality and separability have to be rejected (for instance if no 
repeatable measurements are possible, and if no two sub-systems/ sub-ensembles-of-degrees- 
of-freedom can be considered as really separated /independent), then one might expect that the 
basic principles of quantum mechanics will not survive (and, according to the common lore, 
should be replaced by something even more bizarre and inexplicable...). 

Well! It is time to end this bar room discussion. 
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